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ABSTRACT 


The spread of plant virus disease between fields is reviewed for 
several horticultural and agricultural crops. Previous studies have 
focused on cropping systems where there is scope for using separation 
in time and space to reduce the potential for virus spread between 
plantings. In our study, data are presented on virus disease spread 
between fields in an irrigated rice area in the tropics where crops are 
grown continuously throughout the year. An intensive survey was 
conducted in rice fields planted from November 1992 to October 1994 
in which the occurrence of new infections of rice tungro virus disease 
was recorded. The effect of a number of variables on disease incidence, 


1 Copyright 2006, Elsevier Inc. 
All rights reserved. 

0065-3527/06 $35.00 

DOI: 10.1016/S0065-3527(06)66001-6 


2 T. C. B. CHANCELLOR ET AL. 


including the proximity of a field to inoculum sources, was examined 
using ordinal regression analysis. Primary infection showed large 
seasonal fluctuation. In addition, the number of leafhopper vectors 
had a significant effect, as did the tungro vector resistance of the rice 
variety grown. The distance to nearest inoculum source and the tungro 
incidence of this source significantly affected the level of infection 
occurring in a vulnerable field. The results are discussed in relation 
to management strategies to reduce the spread of tungro disease 
between fields in irrigated lowland rice cropping systems. 


I. INTRODUCTION 


In this chapter we begin by reviewing approaches to investigating 
the spread of plant virus diseases, with a particular focus on virus 
spread between fields. We then present a case study on rice tungro 
virus disease in the tropics. The aim of the research that is described 
was to identify risk factors associated with the spread of rice tungro 
virus disease to new plantings and to use the information to develop 
improved disease management practices. 


A. Economic Importance of Plant Virus Diseases 


Most economically important crop plants are prone to infection with 
viruses, which result in a reduction in yield or in quality (Walkey, 1985). 
The extent of the economic loss may vary greatly and depends on a range 
of factors. Although the impacts of virus diseases are seen primarily in 
reductions in crop yield and quality and in effects on market prices, 
significant economic costs also arise through the need to take preventive 
measures or to implement control strategies. In developing countries 
harmful impacts of plant virus diseases may be particularly severe 
because reduced crop yields can seriously affect the livelihoods of poor 
people and even threaten food security. The epidemic of cassava mosaic 
disease in Uganda in the 1990s led to starvation in some districts in the 
country and is estimated to have resulted in a loss of US$60 million per 
annum during the height of the epidemic (Thresh and Cooter, 2005). 

Virus disease epidemics in perennial crops may be especially serious. 
As it is not usually possible to eliminate the virus infection, crops will 
remain diseased throughout their life. High-value trees infected with 
virus may have to be removed to prevent spread to neighboring 
healthy trees and it may take several years before replacement trees 
become productive. The serious economic consequences that result 
may be seen in the outbreaks of citrus tristeza disease, which have 


SPREAD OF TUNGRO DISEASE TO NEW PLANTINGS 3 


occurred in the Americas (Wutscher, 1977). Crops that are propagated 
vegetatively present particular challenges for virus disease control as 
spread can occur very rapidly through infected planting material. If 
adequate precautions are not taken, spread of viruses through human 
movement of infective vegetative propagules can take place over much 
longer distances than would be possible through natural means 
(Thresh, 1986). 

Oubreaks of virus diseases in annual crops can develop extremely 
rapidly and cause total yield loss in a particular growing season. As with 
vegetative propagules, transfer of infected seed can result in the intro- 
duction of virus disease to new areas. Seed-borne infection leads to early 
onset of disease and therefore creates the potential for significant epi- 
demics to arise in new plantings, assuming that the virus has other 
modes of spread from plant to plant. However, rapid virus disease 
spread may also occur where there are no introduced foci of infection 
through infected planting material. Nonpersistent viruses that are 
transmitted by arthropod vectors can be spread very quickly within a 
crop due to the short feeding periods required for virus acquisition and 
inoculation. Semipersistent viruses are transmitted less rapidly than 
nonpersistent viruses, but they are retained for longer periods in the 
vector and can potentially be carried over longer distances to initiate 
new infections. Semipersistent viruses, such as sugar beet yellows, 
continue to cause serious losses in sugar beet crops in Europe and North 
America. Heathcote (1978) estimated that annual losses incurred by 
sugar beet growers due to the disease are in the region of £4.2 million. 


B. Spread of Plant Virus Disease Within Plantings 


Considerable attention has been devoted by plant virus epidemiolo- 
gists to assessing the spread of virus diseases within crop plantings 
(Campbell and Madden, 1990). The analysis of patterns of diseased 
plants within crops can provide important information about how 
disease is spread. For example, where virus disease is spread by mobile 
arthropod vectors, a random distribution of diseased plants is consis- 
tent with the introduction of primary inoculum by viruliferous vectors 
(Thresh, 1976). Random distributions of virus-diseased plants may 
also be characteristic of seed-borne infection. By contrast, aggregation 
of diseased plants in close proximity to each other suggests secondary, 
plant-to-plant spread of disease (Madden et al., 1982). For virus dis- 
eases that are transmitted mechanically, this may arise through cul- 
tural practices such as weeding or through damage caused by extreme 
weather conditions. Secondary virus disease spread may also arise 
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through the movement of infective arthropod vectors, particularly for 
nonpersistent and semipersistent viruses. 

Many statistical tools are now available to analyze spatial patterns 
of diseased plants in crops and in natural populations. These include 
quadrat-based approaches, adapted from plant ecology, and geostatis- 
tical methods, such as “kriging,” which was developed for geological 
surveys (Gottwald et al., 1996). Perhaps the most commonly used 
statistical tools currently used are those based on distance class meth- 
ods. Distances between individuals in a plant population are measured 
and analyzed in programs such as 2DCLASS. This type of analysis can 
provide information about the size and shape of individual clusters of 
diseased plants, the number of clusters, and the direction between 
clusters (Nelson et al., 1992). 


C. Spread of Plant Virus Diseases Between Fields 
1. Long Distance Virus Disease Spread 


Most vector-borne plant virus diseases spread over relatively short 
distances and disease gradients from known sources of infection have 
been documented over meters or tens of meters (Thresh, 1976). There 
are pathosystems where virus spreads over considerably greater dis- 
tances and these tend to be for viruses that persist for longer in the 
vector. The classic example is beet curly top in the USA, transmitted 
persistently by the leafhopper Circulifer tenellus (Bennett, 1971). 
Wind-assisted flights of C. tenellus from temporary weed hosts in 
California and neighboring states have been recorded over distances of 
over 398 miles (Bennett, 1967). This results in very shallow gradients 
of Beet curly top virus over hundreds of miles. 

It is usually difficult to attribute virus outbreaks to specific sources 
of inoculum. Only rarely has it been possible to track the spread of 
disease from a known source over long distances. One such example is 
a series of experiments on a fungal pathogen, wheat stem rust, con- 
ducted in the 1950s on St. Croix in the Virgin Islands (Kingsolver et al., 
1984). In one experiment, five small fields of wheat were sown at 
successive 3.3 km intervals downwind from a 2.43 ha source of dis- 
eased wheat. The results showed that wind-borne inoculum from the 
disease source initiated destructive epidemics in test fields up to 15 km 
distant. The study was made possible because there were no natural 
sources of leaf or stem rust of wheat on, or upwind of, the island, and 
isolation from wheat-growing regions allowed the study of virulent 
stem rust races in the experiment. 
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2. Field-to-Field Spread of Virus Diseases 


There are no comparable experiments for plant viruses, but there 
have been a few studies of virus dispersal between fields, which have 
led to the development of improved disease management practices. The 
most detailed study was carried out on the spread of lettuce mosaic 
disease between lettuce crops on two farms in the United Kingdom 
during 1947-1950 (Broadbent et al., 1951). Lettuce mosaic disease is 
seed-borne and so the production of virus-free seed is a key component 
of any strategy to control the disease. However, the disease is also 
spread nonpersistently by aphids, the most important of which is Myzus 
persicae. The aim of the studies conducted by Broadbent et al. (1951) 
was to examine the effects of isolation, field size, and crop sanitation on 
the spread of lettuce mosaic to newly planted lettuce fields. 

Patterns of disease spread on two farms with contrasting manage- 
ment practices were compared. On one farm, disease spread from crop 
to crop was reduced by planting or drilling in large blocks, well sepa- 
rated from each other and by not planting a winter crop. By contrast, 
overwintering lettuce was present on the second farm and served as an 
infection source for subsequent spring sowings of the crop. Further- 
more, no attempt was made on this farm to isolate successive lettuce 
crops and substantially higher disease incidence resulted. The authors 
concluded that, in addition to the use of virus-free seed, growers should 
cultivate lettuce fields in discrete blocks rather than allowing them to be 
scattered throughout the farm. They also recommended that the resi- 
dues of lettuce crops should be ploughed in as soon as possible after 
harvest. Crop residue management is also important for other diseases. 
Cauliflower mosaic disease, for example, can usually be controlled by a 
combination of insecticides and physical isolation. However, Garrett 
and Mclean (1983) illustrated an unusual case of high-disease incidence 
in brassica fields in Australia, which was due to the presence of a nearby 
previously infected crop that had not been removed. 

In the United Kingdom, beet yellows and beet mosaic viruses are 
prevalent in areas where seed crops are grown. The incidence of both 
diseases increases with increasing numbers of the main vector Myzus 
persicae, but there is an interesting contrast in the relationship 
between the two diseases and proximity to individual seed crops. 
Surveys revealed that there was no clear relationship between dis- 
tance from a seed crop and the level of infection with Beet yellows virus 
(Watson et al., 1951). However, the same surveys showed that distance 
from the nearest seed crop had a clear effect on the prevalence of 
infection with Beet mosaic virus, although the disease was largely 
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restricted to fields within 100 yards of a seed crop. The difference was 
attributed to the differing transmission characteristics of the two dis- 
eases. Beet yellows is a persistent virus and, as aphids remain infec- 
tive for longer periods, there is a greater likelihood that aphids will 
be carried over relatively long distances from sources of infection. 
By contrast, beet mosaic is a nonpersistent virus and is lost rapidly 
by vectors, especially during feeding. 


3. Geographic Information Systems and Geostatistics 


Some disease systems are highly complex and involve multiple 
vectors and viruses. These present particular challenges for epidemio- 
logical studies, but approaches have been developed based on the 
principles of landscape ecology and utilizing geographic information 
systems and geostatistics (Coulson, 1992). The first published example 
of the use of such approaches for virus diseases was the attempt to 
develop a regional management plan for a complex of tomato diseases 
in the Del Fuerte Valley in Sinaloa, Mexico (Nelson et al., 1994). With 
the exception of Tomato mosaic virus, each of the viruses involved is 
transmitted by a vector (aphid, whitefly, or thrips) and has alternative 
hosts among weeds and other crop plants in the area. The existence of 
these common characteristics provided the rationale for developing 
general virus risk assessments rather than focusing on individual 
virus diseases. 

In the Sinaloa study, field surveys were used to generate risk assess- 
ments for individual fields. The risk assessments were based on data 
collected on the abundance of vector and virus sources within and 
adjacent to the fields planted with tomatoes. The fields were located 
on digitized topographic maps (1:50,000), and coordinates for the cen- 
ter of these fields were then estimated. This method was later supple- 
mented with the use of Global Positioning System units that were able 
to obtain map coordinates to a resolution of about 100 m. Geostatistics 
were then used to analyze disease risk and disease incidence data at a 
regional level. 

Initial findings indicated that both disease risk and incidence were 
spatially dependent variables with a variogram range of 20-25 km 
(Nelson et al., 1994). There was also a correlation between risk assess- 
ment scores and virus disease incidence. The management strategy 
adopted by the project was to apply traditional virus management 
techniques such as removing sources of virus and vectors and facilitat- 
ing the supply of healthy seed (Barnes et al., 1999). These measures 
were focused on the areas of highest risk and led to a gradual reduction 
in the incidence of tomato virus diseases from a peak of over 7% in 
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1991-1992 to less than 1% in 1995-1996. The spatial analysis was not 
needed to implement the virus control measures, but provided a frame- 
work for regional cooperation and information exchange. For example, 
commercial growers producing tomatoes for processing used the infor- 
mation to avoid areas of high-virus disease incidence in previous years. 

Each of the examples discussed earlier involve annual horticultural 
and root crops with distinct cropping seasons. Cropping patterns in 
time and space can be manipulated, at least to some extent, in these 
agricultural systems. In extensive monocultures where continuous 
cropping is practiced, the range of possible management interventions 
is necessarily more limited. In Asia, large areas of continuous rice are 
grown and in some low-lying locations where drainage is restricted 
there are no alternative cropping options for much of the year. Conse- 
quently, the impact of virus diseases in these lowland rice systems can 
be especially serious. 


II. Rice Tuncro DISEASE 


A. The Economic Importance of Rice 


Rice is the main staple food for more than half of the world’s popula- 
tion, including seventeen countries in Asia and the Pacific. In Asia, it 
has been estimated that 2 billion people receive 60-70% of the calories 
in their daily diet from rice (FAO, 2003). There are about 840 million 
undernourished people worldwide, including 200 million children, and 
increased rice production has the potential to significantly reduce 
these numbers. In addition to its importance as a food source, rice 
provides employment in production and postharvest activities for large 
numbers of rural and urban dwellers. Rice-based systems include 
much of the biodiversity of plants, insects, and other organisms in 
large areas of tropical Asia. Insect pest management depends to a 
large extent on predation and parasitism by natural enemies that are 
present in these ecosystems. 


B. The Economic Threat of Rice Virus Disease 


In view of the crucial importance of rice to the livelihoods of people 
in many countries, any major biotic constraint to rice production 
has potentially devastating consequences. Following a large-scale 
expansion of irrigation schemes during the 1960s and 1970s, irrigated 
areas in Asia comprise about 49% of the cultivated land and account 
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for 72% of total production (IRRI, 1989). These areas saw the introduc- 
tion of modern semidwarf high-yielding varieties, which were respon- 
sive to inorganic fertilizers and required good water management to 
realize their full potential. The new rice varieties were not sensitive to 
photoperiod and so could be planted throughout the year where condi- 
tions were suitable. These varieties also matured considerably earlier 
than traditional varieties and local landraces. Consequently, it was 
possible to plant rice continuously and to harvest up to five crops 
within a 2-year period. This dramatic intensification of rice production 
systems resulted in the large increase in yields, which came to be 
characterized as the “Green Revolution.” 

The new systems of rice cultivation led to greatly increased produc- 
tion but they were also vulnerable to a heightened risk of certain pests 
and diseases. During the 1960s a virus disease emerged as a major 
threat to rice production in irrigated lowland ecosystems in Asia. 
There were several large-scale outbreaks of rice tungro virus disease 
in the 1960s and 1970s. These led to severe disruption of rice produc- 
tion in the areas affected and resulted in significant hardship for 
farming families (Thresh, 1989). Further outbreaks arose in the 
1980s and 1990s, but these decreased in frequency and, with one or 
two exceptions, tended to be confined to smaller areas. Although 
tungro was first described in 1965 (Rivera and Ou, 1965), it is probable 
that it was present in rice crops for much longer. This view is sup- 
ported by the fact that there are local names for the disease in several 
countries in South and Southeast Asia (Thresh, 1989). Tungro disease 
emerged as a major problem because the new irrigated production 
systems created the ideal conditions for its persistence and spread. 


C. Biology of Rice Tungro Disease 


Tungro is associated with two leafhopper-transmitted viruses, 
which interact to allow disease development (Hibino, 1983). Rice tun- 
gro spherical virus (RTSV) is an RNA virus in the family Sequiviridae. 
RTSV can be transmitted independently by leaf hopper vectors but does 
not cause visible symptoms or yield loss in most rice varieties. Howev- 
er, RTSV acts a helper virus for the transmission of a DNA/RNA para- 
retrovirus in the family Caulimoviridae. This second virus, Rice tungro 
bacciliform virus (RTBV) is responsible for the symptoms that intensify 
in rice plants with dual infections. Tungro symptoms vary depending 
on the age of the plant, the susceptibility of the variety, and the strains 
of the viruses (Azzam and Chancellor, 2002). Early infections in a 


SPREAD OF TUNGRO DISEASE TO NEW PLANTINGS 9 


susceptible variety lead to severe stunting, limited tiller development, 
and the characteristic yellow to orange leaf discoloration. 

Tungro viruses are transmitted by six leafhopper species, the most 
important of which is the rice green leafhopper Nephotettix virescens 
(Distant). N. virescens transmits tungro viruses more efficiently than 
the other vector species and is usually more abundant in irrigated rice 
fields (Hibino and Cabunagan, 1986). The viruses are transmitted in a 
semipersistent manner by the leaf hopper vectors, which remain infec- 
tive for periods up to 7 days. RTSV can be transmitted independently, 
whereas RTBV may only be spread after leafhoppers feed on rice 
plants with dual infections or when they acquire RTSV first and then 
RTBV (Cabauatan and Hibino, 1988). Rice plants can become infective 
and begin to develop symptoms within 1 week to 10 days of infection 
(Narayanasamy, 1972). 

A wide range of nonrice plants have been reported as hosts of tungro 
viruses based on symptoms or on serology (Khan et al., 1991). However, 
weeds and other nonrice hosts are unlikely to be major sources of virus 
inoculum, with the possible exception of some wild rices in certain 
locations (Tiongco et al., 1993). Leaf hopper recovery of tungro viruses 
from weeds to rice has not been definitively shown in controlled experi- 
ments, especially for N. virescens. Rice seedbeds are not generally 
potential sources of infection, except in areas where seedlings remain 
in nurseries for significantly longer than the normal period of 3 weeks 
(Bottenberg et al., 1990; Tiongco et al., 1993). Regrowth from rice stub- 
bles and volunteer rice plants may act as virus sources, but in areas 
where tungro is endemic rice plants in standing crops are likely to be 
much the most potent inoculum sources. 


D. Within-Field Spatial Patterns of Tungro Disease Spread 


In view of the short feeding periods needed for leaf hopper vectors to 
acquire and transmit the virus and the short latent period of infection 
in rice plants, the potential for rapid spread of tungro within rice crops 
is very high. Kondaiah et al. (1976) designed a circular planting tech- 
nique to examine the plant-to-plant spread of tungro from a central 
inoculum source. Clear gradients of infection were recorded with 
decreasing levels of disease incidence as the distance from the source 
increased. Subsequent experiments conducted in the Philippines 
investigated the role and relative importance of tungro spread into 
and within rice plantings (Satapathy et al., 1997). The results from a 
series of experiments using introduced sources of inoculum showed 
clear spatial patterns. Even in seasons when there were several other 
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disease sources in the area, it was possible to detect the effects of the 
introduced sources through observed disease gradients (Satapathy 
et al., 1997). This suggests that most of the disease spread was second- 
ary, plant-to-plant spread by leaf hoppers that acquired tungro viruses 
from sources within the plots. The experiments also showed that gra- 
dients in a resistant variety were much steeper than those in a suscep- 
tible variety, indicating the potential of resistant varieties to reduce 
secondary spread. 

In other field trials conducted in the Philippines, seasonal patterns 
of leafhopper immigration and population development were exam- 
ined in relation to temporal and spatial patterns of disease incidence 
(Chancellor et al., 1996). Leafhopper immigration into rice plots oc- 
curred during the early stages of crop growth in the rainy season and 
also in a late-planted dry season crop. Early disease infections were 
randomly distributed and clusters of diseased plants emerged as tun- 
gro disease developed, indicating that secondary spread had taken 
place within the plantings. Although tungro disease incidence and 
leafhopper abundance tended to be higher during rainy seasons, there 
was no clear relationship between vector numbers and disease levels. 
The results from this and other studies indicate that leaf hopper vector 
numbers alone are not always an accurate predictor of tungro disease 
risk and must be viewed in combination with the prevailing levels of 
inoculum (Savary et al., 1993). The findings also suggested that there 
was limited scope for the successful deployment of control measures 
aimed at reducing leaf hopper numbers or plant-to-plant spread within 
individual fields. Tungro disease can spread rapidly even when immi- 
gration rates of vectors are low and the use of insecticides to control 
vectors, or of roguing to remove diseased plants, are usually not effi- 
cient enough to have significant impact on the disease (Holt and 
Chancellor, 1996; Tiongco et al., 1998). 


E. Flight Characteristics of Tungro Vectors 


The experiments conducted in the Philippines showed that rapid 
spread of tungro disease within plantings can occur when conditions 
are favorable. However, little information is available about tungro 
disease spread between plantings. The distance over which such 
spread occurs is important in the assessment of tungro risk and for 
the effective targeting of disease management strategies. Although 
there is evidence to suggest that leafhoppers may undertake flights 
of up to at least 30 km (Riley et al., 1987), results from tethered flight 
studies suggest that only a small proportion of a given population 
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makes such long distance flights (Cooter et al., 2000). In an asynchro- 
nously planted area in Bali, Indonesia, where fields are small in size 
and where there are few extensive areas of rice production, most 
leaf hopper dispersal is thought to occur over distances of up to 2 km 
(Suzuki, Y., personal communication). It is not clear whether leaf hop- 
per movement commonly occurs over similar distances in larger rice 
basins, but some circumstantial evidence is available from light trap 
data collected in Central Luzon in the Philippines (Loevinsohn, 1984). 
This suggested that the common dispersal range of tungro vectors was 
about 1 km. 


III. AREA-WIDE INVESTIGATION OF RICE TUNGRO DISEASE INCIDENCE 


In order to examine the main factors influencing temporal and 
spatial patterns of tungro disease in an endemic area, an intensive 
survey was conducted over a 2-year period in the Philippines. We now 
describe the spatial patterns of tungro disease incidence that were 
found in the survey area and examine the main factors that affected 
the risk of primary infection in a field. 


A. Tungro Disease Study Site 


The study site was located at 13°18’ N, 123°18’ E in a tungro- 
endemic area in the province of Albay, Luzon Island, Philippines where 
rice is grown throughout the year. Rice seedlings are raised in seed- 
beds and transplanted into puddled fields at 21-30 days after sowing. 
Irrigation water from the Kinale and San Francisco rivers is generally 
available throughout the year, enabling two or more rice crops to be 
planted annually. Some areas in the study site are poorly drained, 
causing problems with water control during periods of heavy rainfall 
that could occur between the months of June and November. The driest 
period is from January to April when the mean monthly rainfall is 
generally less than 100 mm. 


B. Field Surveys 


Surveys were conducted between January 1993 and December 1994 
in approximately 170 fields in a contiguous area of 150 ha in the 
barangays (villages) of Kinale and Balangibang. From January to 
May 1993, numbers of leaf hopper vectors and tungro disease incidence 
were recorded every 3—4 weeks in rice crops up to panicle emergence at 
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approximately 60 days after transplanting (DAT). The data collection 
was done once a week from June 1993 until the end of the survey 
period. The planting date for each rice crop was also recorded and the 
variety was determined by interviewing the farmer. Varieties with 
resistance to leafhoppers and which had “field” resistance to tungro 
disease were identified based on the classification adopted by Philippine 
Rice Research Institute, and all other varieties were classified as 
susceptible. None of the varieties grown had virus resistance. 

Adults and nymphs of leafhopper vectors were estimated by ten 
sweeps of a 30 cm diameter sweep net. Insects were counted directly 
after collection in the field. Tungro incidence was assessed by counting 
the number of diseased hills (clump of rice plants) in five randomly 
selected quadrats of 100 hills in each field. A hill was considered 
diseased if any tiller within it showed the characteristic symptoms of 
yellow to orange discoloration, twisting of the leaves, and stunting 
of the plant (Ou, 1985). The presence of tungro was confirmed by 
indexing a few leaf samples with enzyme-linked immunological assay 
(Bajet et al., 1985). 


C. Data Analysis 


For each observation date, two categories of rice field were identified: 
those vulnerable to tungro infection and those that could act as infec- 
tion sources. There is a delay between a plant becoming infected and 
the symptoms of the disease becoming clearly identifiable. Thus the 
incidence of new infection must be related to the presence of disease 
sources at an earlier time. This interval used was defined partly by the 
latent period of the disease, and partly by labor constraints on the 
survey protocol. The longest period between sampling dates was 
4 weeks, but this occurred on only three occasions. Except in these 
cases, observations were made at least every 3 weeks throughout the 
study, a period which also allows time for symptoms to become appar- 
ent. Therefore, a 3-week interval was used to relate new infections to 
prior disease sources. 

“Vulnerable” fields were defined as those which were either at 
growth stage tillering or booting, and which also had no symptoms of 
infection 3 weeks earlier (or as close to 3 weeks as observation dates 
allowed). Vulnerable fields thus fall into two categories, those that 
still do not show symptoms of infection and those that do. If a field 
remains uninfected and within the appropriate growth stage, it may be 
recorded over successive observation dates as being vulnerable and 
uninfected. Once a field became infected, it was no longer included 


SPREAD OF TUNGRO DISEASE TO NEW PLANTINGS 13 


when the set of vulnerable fields was next established. Three cate- 
gories of disease incidence were distinguished; fields in which the 
incidence of tungro was zero, <1%, and >1%, respectively. In fields 
with tungro incidence >1%, it was possible to make a quantitative 
estimate of the proportion of plants infected. The occurrence of new 
(primary) infection in a vulnerable field was related to a number of 
possible explanatory variables, including the proximity of the field to 
potential sources of tungro infection. 

An infection source was defined as a field with the rice at the 
tillering or booting stage and in which tungro incidence was >1%. 
Parcelliary maps at a scale of 1:2500, and which showed individual 
field boundaries, were obtained from the Bureau of Land and Soils. 
The maps were digitized using the PC ARC/INFO version 3.4.2 (Envi- 
ronmental Systems Research Institute Inc., New York) geographical 
information systems program, which allowed the coordinates of the 
centroid of each field to be calculated. The distances between the 
centroids of vulnerable fields and sources were calculated using simple 
geometry. Clearly, several fields could act as sources for one vulnerable 
field and our aim was to determine whether tungro infection in a 
vulnerable field was related to the proximity of sources. 

Other variables of interest, which were thought likely to have an 
impact on primary infection, were included in the regression ana- 
lysis. These were the number of vectors recorded in both the vulnera- 
ble field (at the time) and source fields (3 weeks earlier), the variety 
type (resistant or susceptible) of both the vulnerable field and the 
source fields, and the percentage disease incidence recorded in the 
source fields. A further factor, Period, was also included in the analysis 
so that the contribution of variables could be established after seasonal 
fluctuation had been accounted for. Thus, Period accounted for any 
differences in disease incidence that were simply due to variation 
between sampling periods. 


D. Results 


The number of fields into which rice was transplanted in different 
months of the year is shown for the period November 1992 to October 
2004 (Fig. 1). The pattern of planting was broadly similar for leaf hopper- 
resistant and susceptible varieties. There was a marked contrast in 
planting pattern between the first and second years of the study. During 
the initial 12-month period, substantial numbers of fields were planted in 
each month. There were peaks in the number of fields planted in Novem- 
ber 1992—January 1993 and June—August 1993, corresponding to the dry 
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Fic 1. Frequency of planting and mean peak tungro disease incidence for rice fields 
with susceptible (dark shading) and resistant (light shading) varieties in Albay Province, 
Philippines between November 1992 and October 1994. 


and wet seasons, respectively. However, there was no clear break be- 
tween the two cropping seasons. By contrast, planting dates in the second 
year were more closely synchronized and most fields were planted in 
December 1993—January 1994 and in July 1994. Consequently, there 
was a distinct gap between the dry and wet season plantings, with only 
one field planted in April and none in May. As a result, there were very 
few fields of standing rice when the bulk of the new plantings occurred in 
July. 

Tungro disease incidence was highest during the first year of the 
study. Peaks in disease incidence were generally observed during the 
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later plantings within the main cropping seasons: March (1993 and 1994; 
dry seasons) and August-September (1993; wet season). Tungro disease 
levels in the 1994 wet season were very low and no peak in incidence 
could be detected. In each year, mean peak tungro disease incidence was 
higher on susceptible than on resistant varieties. For example, mean 
peak incidence on susceptible varieties planted in August 1993 was 
22%. This compares with an equivalent figure of 5% for resistant vari- 
eties. Similarly, 13% of rice hills in susceptible varieties planted in March 
1994 were affected by tungro, whereas only 5% were affected on resistant 
varieties. 

The period selected for more detailed data analysis covered the 
November 1992—October 1993 plantings. This period was chosen 
because it included the largest number of new infections and source 
fields. A total of eleven sample dates were examined in the analysis 
(Fig. 2). Periods 5 and 7 were omitted because no new infections 
occurred and Period 8 was omitted because there were no disease 
sources. There were distinct differences over time in the distribution 
of tungro infection in the set of vulnerable fields. The first three sample 
periods, corresponding to the early to middle part of the dry season, 
had low-disease incidence. The proportion of vulnerable fields in the 
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Fic 2. The percentage of fields in each infection category and the total number of fields 
vulnerable to new tungro infection over successive periods. 
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“infected” (>1% tungro incidence) and “trace” (<1% tungro incidence) 
categories increased at sample Period 6, at the end of the dry season. 
Infection was greatest during the final four sampling periods, which 
corresponded to the middle to later part of the rainy season. 

The pattern of tungro infection over time was strongly associated 
with the average distance to an infection source (Fig. 3). The percent- 
age of rice fields with zero infection was generally lowest during per- 
iods when the mean distance to the nearest source was shortest. There 
were no obvious temporal trends in the cultivation of resistant or 
susceptible varieties, with the percentage of susceptible varieties rang- 
ing from 27% to 37% over the 11 sampling periods. Similarly, while 
leafhopper vector abundance in the vulnerable fields fluctuated over 
time, there was no clearly discernible pattern linking it to seasonal 
fluctuation in infection. 

In order to investigate the effect of distance to sources and other 
variables on the risk of a vulnerable field becoming infected, the effect 
of Period was first included in the regression. Other variables making 
an additional significant contribution to the model therefore have an 
effect in addition to any caused by the seasonal fluctuation. The results 
of the regression analysis are shown in Table I. As expected, there was 
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Fic 3. Period averages for tungro-vulnerable fields: the percentage healthy (i.e., 
showing zero tungro infection), the percentage of a susceptible variety, the distance to 
a tungro source, and the vector abundance. 
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TABLE I 


ORDINAL REGRESSION MODEL FOR TUNGRO INCIDENCE CATEGORIES: ZERO, TRACE, AND INFECTED, 
802 D.F., Deviance Ratio 20, APPROXIMATE x? P < 0.001 


Variable Estimate S.E t pt) 

Period 9 0.825 0.278 2.97 0.003 
Period 10 0.935 0.270 3.47 <0.001 
Period 12 1.987 0.309 6.43 <0.001 
Period 13 1.236 0.372 3.32 <0.001 
Period 14 1.351 0.429 3.15 0.002 
Period 15 1.251 0.342 3.66 <0.001 
Distance to next-nearest source —0.001731 0.000364 —4.76 <0.001 
Tungro incidence in nearest source 0.0400 0.0158 2.53 0.011 
Tungro-resistant variety —0.559 0.242 2.31 0.021 
Vector abundance 0.0603 0.0105 5.76 <0.001 
Vector abundance in nearest source —0.00912 0.00405 —2.25 0.025 


Resistant variety x vector interaction —0.0261 0.0117 —2.23 0.026 


All significant two-way interactions and main effects are included. Interactions with 
period cannot be included as liaised with other variables. Parameters for factors, Period 
and Variety, are differences compared with the reference levels, Period 2 and tungro- 
susceptible variety. 


a strong effect of Period with six dates contributing to the model when 
compared with the reference level, Period 2. 

In addition to the effects of Period, five other variables made signifi- 
cant contributions to explaining differences in tungro infection in the 
set of vulnerable fields. Two variables concerned the vulnerable field 
itself: the variety reaction to tungro and the abundance of leafhopper 
vectors. Not unexpectedly, higher tungro infection was associated with 
tungro-susceptible varieties (p < 0.02) and higher vector abundance 
(p < 0.001). The interaction between variety reaction and vector 
abundance was also significant, indicating that high-vector abundance 
had less effect when the variety concerned was resistant to tungro. 
This was the only interaction term that proved significant. A separate 
analysis showed that, for the set of vulnerable fields infected with 
tungro disease, in other words those in which incidence was >1% 
and could therefore be quantified, mean incidence was 6.17% in sus- 
ceptible varieties compared with 2.42% in resistant varieties (n = 100, 
p < 0.001). 

Three variables associated with tungro sources had a significant 
effect on tungro disease risk in vulnerable fields. The strongest effect 
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Fic 4. Distance from the vulnerable field to tungro sources are highly correlated. 


was that of distance. The approach taken in the analysis to investigate 
the effect of tungro sources was first to include the variables relating to 
the nearest source, then to add those for the next nearest source. To 
some extent sources were correlated, for example, if the nearest source 
is a long distance away the next-nearest source must be at least that 
far. In fact, the distance to the nearest source and the next-nearest 
source were highly correlated (Fig. 4), and it was the distance to next- 
nearest source that proved to be a slightly better predictor of tungro 
infection in vulnerable fields than the distance to the nearest source, 
the two terms being interchangeable in the model. Infected fields were, 
on an average, significantly closer to tungro sources than fields, which 
remained free of tungro disease (Fig. 5). 

An example of the spatial pattern of tungro is illustrated in Fig. 6, 
which shows the distribution of tungro incidence at two successive 
sampling dates. These dates correspond to Periods 11 and 12 in 
Fig. 2. The figure illustrates a period when tungro was increasing, 
and by comparison of the successive patterns it can be seen that new 
infections often occurred close to prior infections that act as sources. 
Infections that are present in the first map, but not the second, repre- 
sent those fields with crops that have become too mature to be disease 
sources or which have been harvested. 

Both tungro incidence in the nearest source and vector abundance in 
the nearest source made significant contributions to the model, in this 
case slightly better than the equivalent terms for the next-nearest 
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Fic 5. Average distances to sources of vulnerable fields falling in each tungro infection 
class. 


source. Higher levels of disease in sources led to an increased risk of 
infection in vulnerable fields, as might be expected. For vulnerable 
fields with zero tungro infection, mean tungro disease incidence in 
both nearest and next-nearest sources was 3% (Fig. 7). By contrast, 
for fields with at least 1% infection, mean disease incidence in nearest 
and next-nearest sources was 5% and 7%, respectively. 

The effect of vector abundance in sources on tungro risk in vulnera- 
ble fields is less easy to explain. The negative coefficient indicates that 
lesser number of vectors in the nearest source was associated with 
increased risk of infection. The effect was small being an order of 
magnitude less than the effect of vector numbers in the vulnerable 
field itself. Explained by the lesser differences between infection cate- 
gories compared with those in the vulnerable field, the term moderates 
the larger effect of vector differences between the categories of vulner- 
able field (Fig. 8). 


E. Discussions 


The survey was conducted over 2 years, which differed greatly in the 
pattern of rice planting and in tungro disease incidence. Continuous 
planting during the first year was associated with relatively high levels 
of tungro incidence, particularly in fields planted during the middle and 
later periods of the rainy season. By contrast, the planting pattern was 
more closely synchronized during the second year, especially during 
the rainy season. Farmers did not make a conscious decision to reduce 
the temporal variability of planting times during the second year. They 
were forced to plant in this way due to a shortage of water caused by 
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Fic 6. Maps of a section of the study area showing the change in infection pattern 
between two successive sampling periods. 
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Fic 7. Average disease incidence in tungro infection sources for each category of 
infection of vulnerable fields. 
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Fic 8. Average abundance of leaf hopper vectors in vulnerable fields and their nearest 
sources, for each infection category of the vulnerable field. 


repair work conducted on irrigation facilities during the months of 
April and May 1994. The 2-month break in planting significantly 
reduced the potential for tungro inoculum to be carried over from 
the later dry season plantings. This resulted in much lower tungro 
disease incidence than in the equivalent period during the previous 
year. The findings support results from elsewhere in the Philippines, 
which indicated that staggered planting contributes to increased 
tungro incidence (Cabunagan et al., 2001; Loevinsohn, 1984). In 
South Sulawesi, Indonesia, staggered planting was thought to be a 
major cause of tungro problems, and regulations were introduced to 
synchronize planting during periods when vector populations were 
relatively low (Sama et al., 1991). 


22 T. C. B. CHANCELLOR ET AL. 


The results of the regression analysis showed that the risk of tungro 
infection in a vulnerable field varied significantly at different times 
during the “high incidence” year that was examined. Early rice plant- 
ings tended to be largely free of tungro disease and the risk of infection 
increased toward the end of the dry season. As the proportion of infected 
fields increased, the mean distance to disease sources reduced 
thus increasing the potential risk to the declining number of healthy 
“vulnerable” fields. The risk of tungro infection was relatively high 
throughout most of the rainy season. Higher tungro incidence during 
the rainy season has been reported in several studies and is usually 
explained by a combination of greater leaf hopper abundance or activity 
and a higher proportion of available fields planted to rice (Thresh, 
1989). In rainy seasons, leafhopper generations generally develop quite 
rapidly with peak population densities occurring during the middle of 
the season. In dry seasons, population build-up is more gradual and 
peak densities are often reached when plants are at an advanced growth 
stage (Chancellor et al., 1996; Cook and Perfect, 1989). 

The temporal pattern in leaf hopper abundance was not related to that 
of disease during the period analyzed. However, vector abundance in the 
vulnerable field, though not in the nearest source, contributed signifi- 
cantly to the probability of new infection. The importance of overall 
number of vectors on the risk of tungro occurrence has been emphasized 
by Suzuki et al. (1997). Older crops tend to be more important sources of 
leafhoppers, whereas young crops may supply a greater proportion 
of infective leafhoppers due to the higher efficiency with which virus is 
recovered during feeding. Further, early season immigration of leafhop- 
pers tends to be greatest in areas where planting dates are highly 
variable and there are overlapping rice crops (Widiarta et al., 1990). 

The relationship between vector numbers and disease incidence is 
often complex but is largely dependent on the availability of inoculum 
sources. In a comparison of factors affecting tungro outbreaks in 
endemic and nonendemic areas in the Philippines, the occurrence of 
outbreaks in the nonendemic area was primarily determined by 
the appearance of viruliferous vectors. Overall leafhopper numbers 
were also important but the epidemics were essentially driven by the 
availability of sufficient inoculum (Savary et al., 1993). There were 
differences in the disease—vector relationships between the two endemic 
areas included in the study but outbreaks were less responsive to 
changes in inoculum. Presumably, if inoculum is not limiting, one of the 
basic requirements for a tungro epidemic has been fulfilled and vector 
abundance then has greater influence on disease development, although 
the outcome will depend on other factors such as the susceptibility of rice 
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varieties. Similar conclusions can be drawn from other studies conducted 
in Southeast Asia (Chancellor et al., 1996; Suzuki et al., 1992). 

Our results suggest that a strategy aimed at reducing leafhopper 
numbers in vulnerable fields is likely to reduce the risk of tungro occur- 
ring. Insecticides have been used in attempts to prevent tungro infec- 
tion from occurring (e.g., Thresh, 1989). However, insecticides are not 
always effective in controlling tungro, especially where leafhopper 
immigration pressure is high and may occur over a long period and 
where there is no coordination between farmers in a locality (Suzuki 
et al., 1997). Further, the threat posed to human health and the risk of 
initiating resurgence of other pests, principally the brown planthopper 
Nilaparvata lugens (Stal), are major constraints to insecticide use in 
rice (Holt et al., 1996). 

Rather than attempting to protect vulnerable fields, another ap- 
proach might be to target the use of insecticides on source fields in an 
attempt to reduce the potential for disease spread to neighboring rice 
crops. From the 1970s, it has been a common practice in the Philippines 
for the Department of Agriculture to organize “mass spray” campaigns 
against the insect vector in response to a tungro outbreak (Warburton 
et al., 1997). Although a more broad-based approach to tungro manage- 
ment has evolved in recent years, spraying campaigns are still con- 
ducted in endemic areas if tungro incidence in a particular locality is 
perceived to be great enough to pose a threat to neighboring rice areas. 
Both the affected area and a buffer zone around it are usually sprayed in 
order to try to prevent the spread of the disease, but it is not known 
whether this approach has been successful. As with the use of insecti- 
cides to protect vulnerable fields, the strategy has the disadvantage of 
posing a threat to human health and of creating the conditions for 
outbreaks of other secondary pests. 

The regression analysis showed that the risk of tungro infection in a 
vulnerable field was associated with tungro incidence in the nearest 
source. In “point” source experiments carried out in the Philippines, to 
examine the effect of sources within rice plantings, there were signifi- 
cant effects of source size on tungro incidence (Satapathy et al., 1997). 
The effects were less pronounced during a wet season trial when 
background levels of disease in the surrounding area were high. How- 
ever, in a dry season trial when disease incidence was relatively low, 
even the effect of a single-hill central source could be detected (p < 0.05). 
Mean disease incidence in the single-hill plots was 3.3%, compared 
with 0.6% in the plots with no source and 11.0% in plots with a central 
source of 25 infected hills. The data from our survey demonstrate that 
a source effect operates over a larger scale between rice fields. 
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The strongest effect among variables associated with tungro sources 
was shown by the variable that was most central to the objectives of the 
study—distance to inoculum sources. The best-fit regression model 
included distance to the “next-nearest” source, but this was only margin- 
ally better than distance to the nearest source and the two variables were 
strongly correlated. In general, the addition to the regression of variables 
associated with the next-nearest source added rather little to the model, 
which included only those for the nearest source. It can be argued, 
therefore, that the nearest source contributed most of the information 
about source, which was relevant to explaining new infections. As dis- 
cussed earlier, proximity effects in relation to infection sources have been 
demonstrated for Lettuce mosaic virus (Broadbent et al., 1951) and Beet 
mosaic virus (Watson et al., 1951), which are both nonpersistent aphid- 
borne viruses. Similarly, infection sources in neighboring fields and 
ditches were shown to have a strong influence on the risk of infection of 
tomato crops with a virus disease complex transmitted by a range 
of aerial vectors (Nelson et al., 1994). The relatively short retention 
period of tungro viruses in their leafhopper vectors, and the tendency 
for most of the leafhoppers to make short flights (Chancellor et al., 1997), 
probably explain the proximity effect observed in our study. 

The methodology utilized in our study has enabled the magnitude of 
the distance effect to be quantified and the results to be used to develop 
practical management recommendations for farmers. The results show 
that the average distance to an inoculum source for vulnerable rice 
fields which became infected was about half that for fields in which no 
disease developed. Very few vulnerable fields were infected with tun- 
gro when they were located at distances of over 600 m from an inocu- 
lum source. Figure 9 illustrates that the infection risk approximately 
halved for every additional 200 m distance increment to the nearest 
source. Of course, constrained by the size of the study area, the total 
number of fields was fewer in the higher distance categories, but even 
in the highest category (>1000 m) there were 20 fields, none of which 
were infected. Thus, where farmers have the option of planting a 
resistant variety they should do so if there is a source within a 
few hundred meters. As the risk of infection is significantly lower at 
greater distances, farmers may decide to accept this risk by planting 
a preferred susceptible variety. The risk of infection will also be 
influenced by the level of tungro disease in the source field, but this 
may not be so easy for farmers to determine. 

As expected, growing a resistant variety significantly reduced the 
risk of a vulnerable rice field from becoming infected. There was also a 
significant interaction of resistant variety and leaf hopper abundance, 
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Fic 9. Probability (%) of a vulnerable field becoming infected at different distances 
from the nearest tungro source. The total number of fields at each distance is also shown. 


indicating that the effect of vectors in increasing risk was less pro- 
nounced on resistant varieties. The varieties classified as resistant in 
the survey were resistant to N. virescens and this conferred “field 
resistance” to tungro, which has been shown to be effective in other 
locations in the Philippines (Cabunagan et al., 2001). In some areas of 
high-tungro incidence, resistance characteristics were found to be an 
important criterion for varietal selection (Warburton et al., 1997). 
However, other characteristics, such as aroma and grain quality, may 
be considered more important by farmers. As the adoption rate of 
resistant varieties is likely to be influenced by these considerations, 
it may be more appropriate to target the deployment of resistant 
varieties to the most vulnerable fields and to later plantings, which 
carry a higher risk. Focusing on later plantings would have the added 
benefit of reducing the amount of inoculum that can be carried over 
into the next cropping season. 

In the case of the lettuce, sugar beet, and tomato crops mentioned 
earlier, virus disease management recommendations included greater 
spatial or temporal separation between fields or blocks of fields to 
reduce the risk of primary infection. Spatial separation is clearly not 
possible in an irrigated lowland rice area where rice is the dominant 
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crop and is grown throughout the year. Temporal separation may be 
achieved in areas where irrigation water is scheduled and there is 
evidence to suggest that the imposition of a fallow period between rice 
crops has contributed to a reduction in tungro incidence in some 
localities (Sama et al., 1991; Taib, 1987). The results of our study 
indicate that a break between crops can reduce carryover of disease 
into the next season. However, modeling work suggests that there is a 
critical relationship between planting date variance and the length of 
the fallow period needed to reduce carryover of disease (Holt and 
Chancellor, 1997). Model outputs showed that persistence of disease 
within a matrix of fields was determined primarily by planting date 
variance. There was a threshold of planting date variance where a 
relatively small reduction in variance led to a large reduction in dis- 
ease incidence. However, where planting date variance was relatively 
high the response to changes in variance was very small, and disease 
persisted in the system. Thus, attempts to reduce tungro incidence by 
manipulating planting dates are more likely to be effective in areas 
with moderate rather than high levels of cropping asynchrony. 

Most research on the spread of virus diseases has concentrated on 
patterns of disease spread within fields. Such experiments can deter- 
mine whether primary spread is taking place and can quantify patterns 
of any secondary spread that occurs. Different approaches are needed to 
answer questions about the spread of virus disease between fields. 
Surveys of the kind we carried out in Polangui are costly in time and 
resources. However, they do provide a means of examining the factors 
that affect risk of primary infection by a virus disease in space and time. 
The occurrence of new infections can be explicitly related to the occur- 
rence of inoculum sources and, as we have shown for rice tungro disease, 
it may be possible to detect and quantify a proximity effect. 
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ABSTRACT 


Poxviruses comprise a large family of viruses characterized by a large, 
linear dsDNA genome, a cytoplasmic site of replication and a complex 
virion morphology. The most notorious member of the poxvirus family is 
variola, the causative agent of smallpox. The laboratory prototype virus 
used for the study of poxviruses is vaccinia, the virus that was used as a 
live, naturally attenuated vaccine for the eradication of smallpox. 

Both the morphogenesis and structure of poxvirus virions are unique 
among viruses. Poxvirus virions apparently lack any of the symmetry 
features common to other viruses such as helical or icosahedral capsids 
or nucleocapsids. Instead poxvirus virions appear as “brick shaped” or 
“ovoid” membrane-bound particles with a complex internal structure 
featuring a walled, biconcave core flanked by “lateral bodies.” The virion 
assembly pathway involves a remarkable fabrication of membrane- 
containing crescents and immature virions, which evolve into mature 
virions in a process that is unparalleled in virology. As a result of 
significant advances in poxvirus genetics and molecular biology during 
the past 15 years, we can now positively identify over 70 specific 
gene products contained in poxvirus virions, and we can describe the 
effects of mutations in over 50 specific genes on poxvirus assembly. This 
review summarizes these advances and attempts to assemble them into 
a comprehensible and thoughtful picture of poxvirus structure and 
assembly. 


I. PREFACE 


Both the morphogenesis and structure of poxvirus virions are unique 
among viruses. Poxvirus virions apparently lack any of the symmetry 
features common to other viruses such as helical or icosahedral capsids 
or nucleocapsids. Instead poxvirus virions appear as “brick shaped” or 
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“ovoid” membrane-bound particles with a complex internal structure 
featuring a walled, biconcave core flanked by “lateral bodies.” The virion 
assembly pathway involves a remarkable fabrication of membrane- 
containing crescents and immature virions, which evolve into mature 
virions in a process that is unparalleled in virology. 

Poxvirus structure and assembly has been the subject of intense 
scrutiny since the virus was first subjected to detailed examination 
by electron microscopy in 1950s and 1960s, yet the most recent com- 
prehensive reviews of the subject are at least 15 years old (Dales and 
Pogo, 1981; Fenner et al., 1989; see also Moss, 2001; Sodeik and 
Krijnse-Locker, 2002). During these 15 years, key advances in poxvir- 
ology have brought us to a point where a comprehensive review of 
virus structure and assembly is appropriate. First, advances in ge- 
nome and protein sequencing now readily permit the unambiguous 
assignment of protein products to specific genes on the poxvirus ge- 
nome (Chung et al., 2006; Goebel et al., 1990; Jensen et al., 1996; 
Takahashi et al., 1994). Second, advances in poxvirus genetics have 
yielded both well-characterized collections of temperature-sensitive 
mutants and techniques for targeted construction of conditional lethal 
mutants with alterations in specific poxvirus genes (Condit and Niles, 
1990; Hassett and Condit, 1994; Lackner et al., 2003; Rodriguez and 
Smith, 1990b; Zhang and Moss, 1991a). As a result of these advances, 
we can now positively identify over 70 specific gene products contained 
in poxvirus virions, and we can describe the effects of mutations in 
over 50 specific genes on poxvirus assembly. This review summarizes 
these advances and attempts to assemble them into a comprehensible 
and thoughtful picture of poxvirus structure and assembly. 


II. INTRODUCTION 


A. Poxvirus Biology and Replication: An Overview 


Poxviruses comprise a large family of viruses that infect a wide 
variety of vertebrate and invertebrate hosts. Members of the family 
Poxviridae are characterized by a large, linear dsDNA genome, a 
cytoplasmic site of replication and a complex virion morphology. The 
most notorious member of the poxvirus family is variola, the causative 
agent of smallpox. The laboratory prototype virus used for the study of 
poxviruses is vaccinia, the virus that was used as a live, naturally 
attenuated vaccine for the eradication of smallpox. Because the vast 
majority of genetic and biochemical studies on poxvirus structure and 
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morphogenesis have been performed with vaccinia, this review will 
confine itself exclusively to this virus. Members of the poxvirus family 
are sufficiently similar such that lessons learned from vaccinia can 
easily be applied to other poxviruses. 

The vaccinia virus genome is approximately 200 kb in length and 
encodes approximately 200 genes (there is some strain variation in 
DNA size and coding capacity) (Goebel et al., 1990; Johnson et al., 
1993; Lefkowitz et al., 2005). Because of the cytoplasmic site of virus 
replication, vaccinia viral mRNAs are not spliced, and therefore vac- 
cinia genes do not contain introns, greatly simplifying interpretation of 
genomic sequence. Genes are closely spaced on the genome, and each 
gene appears to be controlled by its own transcriptional promoter. 
Comparison of poxvirus genomes reveals a set of 91 genes that are 
conserved throughout the Chordopoxvirinae subfamily (vertebrate 
poxviruses); a subset of 49 of these genes are conserved throughout 
the entire Poxviridae family (including both vertebrate and inver- 
tebrate poxviruses) (Upton et al., 2003). The remaining nonconserved 
genes, many of which are non essential for replication in cell 
culture, dictate individual virus characteristics of host range and 
pathogenicity. 

Poxvirus virions exist in three infectious forms: mature virions 
(MV), wrapped virions (WV), and extracellular virions (EV).1, MV, 
the simplest form of the virus, are membraned particles containing a 
biconcave, DNA-containing core flanked by lateral bodies, which fill 
the concavities of the core. MV are normally found exclusively inside 
cells and are liberated only by cell lysis. WV consist of MV which are 
surrounded by two additional lipid bilayers derived from trans-Golgi 
cisternae. WV, whose outer membranes contain characteristic viral 
proteins, are precursors of EV and are also found within the cell. EV 
consist of WV which have been exocytosed via fusion of the outermost 
WV membrane with the plasma membrane, leaving an MV wrapped in 
one additional membrane. A fraction of EV are found attached to the 
cell surface, while some are found free in the extracellular medium. EV 
are thought to be important for spread of the virus within an organism, 


1 This nomenclature follows a recent proposal by Moss (2005). Previously, MV was 
called intracellular mature virus (IMV), WV was called intracellular enveloped virus 
(IEV), and EV included both cell associated extracellular enveloped virus (CEV) and 
extracellular enveloped virus (EEV). In older literature, MV was called “intracellular 
naked virus” (INV) (Sodeik et al., 1993), thus MV is identical to both IMV and INV. 
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while MV are thought to be important for long-term stability and 
transmission of the virus between hosts in the environment. It should 
be noted, however, that the EV membrane is disrupted when EV 
bind to cells, enabling MV to make direct contact with the plasma 
membrane and enter cells (Law, M., Carter, G. C., Roberts, K. L., 
Hollinshead, M., and Smith, G. L., personal communication). The 
extra membrane composition, assembly, and transport of WV and EV 
have been reviewed (Smith et al., 2002); therefore, this review will 
confine itself exclusively to the structure and assembly of MV. 

MV enter cells by fusion of the MV membrane with the plasma 
membrane of the host cell, releasing the core (and lateral bodies) 
into the cytoplasm and activating the virus’ transcriptional program 
(Armstrong et al., 1973; Carter et al., 2005). The virion cores contain 
the full complement of virus-coded enzymes required for synthesis and 
modification of early mRNA, including a nine subunit RNA polymer- 
ase, a virus early transcription factor (VETF), a capping enzyme, and a 
poly(A) polymerase. Early genes encode enzymes required for DNA 
replication, and thus as early gene expression peaks, viral DNA repli- 
cation ensues in cytoplasmic sites termed “factories.” Early genes 
also encode intermediate transcription factors, and intermediate 
genes, in turn, encode late transcription factors, so that intermediate 
and late genes are expressed in succession after the prerequisite initi- 
ation of viral DNA replication. Thus, the full complement of viral genes 
are transcribed in a temporal cascade, with the early, intermediate 
and late classes being distinguished by class-specific transcriptional 
promoters and virally encoded transcription factors. Furthermore, 
only replicated genomes are competent templates for intermediate 
and late transcription. These two classes of genes together encode 
virion structural proteins, virion enzymes, and assembly factors re- 
quired for assembly of new virus particles (for a review of poxvirus 
replication, see Moss, 2001). 


B. Genetic Nomenclature 


The first complete poxvirus genome sequence to be published was 
that of the Copenhagen strain of vaccinia virus (Goebel et al., 1990). 
With the publication of this sequence, a genetic nomenclature was 
adopted in which genes were named according to their position relative 
to the left end of individual Hind III restriction fragments of the 
genome and also according to their transcriptional orientation. Thus, 
for example, the virus DNA polymerase is encoded by gene E9L, the 
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ninth gene from the left end of the Hind III E fragment, and the gene is 
transcribed in a leftward direction relative to the standard orientation 
of the genome. 

While Copenhagen was the first vaccinia strain sequenced, most of 
the genetic and biochemical analysis of vaccinia has been performed 
using the WR strain, a strain that is not identical in genetic content to 
Copenhagen. Initially, the Copenhagen system of nomenclature 
was applied to WR; however, strain variation, sequencing glitches, 
and the absence of complete genome sequence made this usage occa- 
sionally confusing. For example, the vaccinia strain WR genes initially 
dubbed A5L and A8L are orthologous to the Copenhagen genes A4L 
and A7L, while further to the right in the Hind III A fragment, 
WR genes orthologous to Copenhagen genes were given the pre- 
cise Copenhagen name. Despite this initial confusion, over the years 
the Copenhagen nomenclature has become the most familiar among 
poxvirologists, and a custom has emerged which is to refer to genes in 
WR using the orthologous Copenhagen gene identification. According- 
ly, the WR genes previously called A5L and ASL are now referred to as 
A4L and A7L. 

More recently, faced with an explosion of poxvirus genome sequences 
available and the impossibility of maintaining the Copenhagen no- 
menclature across such a diversity of sequences, a new nomenclature 
has emerged in which genes are simply numbered in sequence from 
left to right on the genome, along with an identifier specifying the 
virus strain. Thus while the E9L gene in Copenhagen is now officially 
“VV-Cop-E9L,” in WR it is “VACWRO65.” 

This review deals almost exclusively with experiments done with the 
WR strain of vaccinia. For the sake of clarity, we have maintained 
the custom of referring to WR genes using the Hind III designation 
of the Copenhagen ortholog. Following established convention, genes 
are referred to using the entire designation, for example, “E9L” for the 
DNA polymerase gene, while proteins are specified by omitting the “L” 
or “R,” for example, “E9” for the DNA polymerase enzyme. For the sake 
of precision, in Table I we have included a column specifying the 
official WR gene name for each protein. 


C. Poxvirus Genetics: Special Considerations 


Conditional lethal mutants have been isolated which encode altera- 
tions in over half of the essential genes in vaccinia virus. These mu- 
tants are of two different types, inducible and temperature sensitive. 
The two different types of mutants function by fundamentally different 
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TABLE I 


VACCINIA VIRION PROTEINS 





Gene* WR designation Mw Location Mutants* Comment? 

D13L VACWR118 62 Absent ind, ts Rifampicin target, IV scaffold protein 

AlIR VACWR130 36 Absent ind Hydrophobic, phosphorylated (independent 
of F10), interacts with A32, itself 

A2.5L VACWR121 9 Virion enzyme ind Thiol oxidoreductase 

A82L VACWR155 31 Virion ind Putative ATPase, DNA encapsidation 

A17L VACWR137 23 Membrane ind TM(2), F10-dependent phosphorylation, 
cleaved, complex with A14, A27 

Al4L VACWR133 10 Membrane ind TM(2), F10-dependent phosphorylation, 
dimerizes, complex with A17, A27 

A13L VACWR132 8 Membrane ind, ts TM(1), phosphorylated (independent of F10), 
DNA encapsidation 

ASL VACWR128 12 Membrane ind TM(1) 

LIR VACWR088 27 Membrane ind TM(1), myristylated, redox substrate 

H3L VACWR101 37 Membrane ind, ko TM(1), heparin binding 

A21L VACWR140 14 Membrane ind TM(1), redox substrate, fusion complex 

L5R VACWRO092 15 Membrane ind TM(1), redox substrate, fusion complex 

A28L VACWRI151 16 Membrane ind, ts TM(1), redox substrate, fusion complex 

H2R VACWR100 22 Membrane ind TM(1), redox substrate, fusion complex 

G3L VACWRO79 13 Membrane ts TM(1), fusion complex 

GOR VACWRO087 39 Membrane ind TM(1), myristylated, fusion complex 

J5L VACWR097 15 Membrane TM(1), myristylated, fusion complex 

A16L VACWR136 43 Membrane ind TM(1), myristylated, fusion complex 

I2L VACWRO71 8 Membrane ind TM(1) 

D8L VACWR113 35 Membrane ko TM(1), chondroitin binding 











(continues) 
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TABLE I (continued) 








Gene* WR designation Mwt Location Mutants* Comment? 

A27L VACWR150 13 Membrane ind, ko No TM, complex with A17, A14, heparin 
binding 

A26L VACWR149 58 Membrane ko p4c, TM(?) 

A14.5L VACWR134 6 Membrane ko TM(2) 

I5L VACWRO074 9 Membrane TM(2) 

E10R VACWRO66 11 Membrane ind Thiol oxidoreductase, no TM 

enzyme 
G4L VACWRO81 14 Membrane ind Thiol oxidoreductase, no TM 
enzyme 

HbR VACWR103 22 Core ts F10, B1-dependent phosphorylation, roles in 
DNA replication, transcription, 
morphogenesis 

G5R VACWRO082 50 Core ind 

A30L VACWR153 9 Core ind, ts 7 complex, F10-dependent 
phosphorylation, partially NP40 
extractable, in viroplasm, IV’s 

G7L VACWRO085 42 Core ind, ts 7 complex, F10-dependent phosphorylation, 
cleaved 

JIR VACWR093 18 Core ind, ts 7 complex, partially NP40 extractable 

AI5L VACWR135 11 Core ind 7 complex 

D2L VACWR107 17 Core ind, ts 7 complex 

DƏR VACWR108 28 Core ts 7 complex 

A10L VACWR129 102 Core ind, ts p4a/4a, cleaved, complex A4, core wall? 

I6L VACWR075 43 Core ts Telomere binding, DNA encapsidation 

IIL VACWR070 36 Core ind DNA binding 

F17R VACWRO056 11 Core ind H1 dependent, partially F10 dependent 


phosphorylation, DNA binding 


6€ 


A4L 


A3L 
L4R 
E8R 
E11L 
L3L 
Al2L 
J6R 
A24R 
H4L 


A29L 
E4L 
J4R 
A5R 
D7R 
G5.5R 
A7L 


D6R 


DIR 


D12L 


VACWR123 


VACWR122 
VACWRO91 
VACWRO64 
VACWRO067 
VACWRO090 
VACWR131 
VACWRO98 
VACWR144 
VACWR102 


VACWRI152 
VACWRO60 
VACWRO96 
VACWR124 
VACWR112 
VACWR083 
VACWR126 


VACWR111 


VACWR106 


VACWR117 


31 


73 
28 
32 
15 
41 
20 
147 
133 
94 


35 
30 
21 
19 
18 


82 


74 


97 


33 


Core 


Core 
Core 
Core 
Core 
Core 
Core 
Core enzyme 
Core enzyme 
Core enzyme 


Core enzyme 
Core enzyme 
Core enzyme 
Core enzyme 
Core enzyme 
Core enzyme 
Core enzyme 


Core enzyme 


Core enzyme 


Core enzyme 


ind 


ts 
ind 
ts 
ts 
ind 
ts 


ts 


ind, ts 


ts 


ts 


ts 


ind 


ind, ts 


ts 


ts 


Core wall spike protein(?), membrane 
associated(?), F10, H1-dependent 
phosphorylation, complex with A10 


p4b/4b, cleaved, core wall 
VP8, cleaved 
TM(2), phosphorylated 


Cleaved 
RNA polymerase subunit, rpo147 
RNA polymerase subunit, rpo132 


RNA polymerase subunit, rap94, early gene 
specificity factor, required for early 
transcription termination 


RNA polymerase subunit, rpo35 

RNA polymerase subunit, rpo30 

RNA polymerase subunit, rpo22 

RNA polymerase subunit, rpo19 

RNA polymerase subunit, rpo18 

RNA polymerase subunit, rpo7 

Early gene transcription initiation 
factor, VETF 

Early gene transcription initiation 
factor, VETF 

Capping enzyme, also early gene 
termination factor, intermediate gene 
initiation factor 

Capping enzyme, also early gene 
termination factor, intermediate gene 
initiation factor 
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TABLE I (continued) 








Gene* WR designation Mw Location Mutants* Comment’ 

EIL VACWR057 55 Core enzyme Poly(A) polymerase catalytic subunit 

J3R VACWR095 39 Core enzyme ko Poly(A) polymerase elongation subunit, also 
2' 0 methyltransferase, transcription 
elongation factor 

D11L VACWR116 72 Core enzyme ts Early gene termination factor, NPH I 

I8R VACWRO077 78 Core enzyme ts RNA helicase 

A18R VACWR138 57 Core enzyme ts Postreplicative transcription termination 
factor, RNA release factor 

A22R VACWR142 22 Core enzyme ind Holliday resolvase, concatemer resolution, 
palmitylprotein 

H6R VACWR104 37 Core enzyme ko Topoisomerase 

K4L VACWR035 49 Core enzyme ko DNA nicking and joining enzyme, 
nonessential 

GIL VACWR078 68 Core enzyme ind Metalloproteinase(?) 

I7L VACWRO76 49 Core enzyme ind, ts Cysteine proteinase(?), virion protein 
processing 

FI10L VACWR049 52 Core enzyme ts, ind Ser/Thr/Tyr kinase, 7 complex 

BIR VACWR183 34 Core enzyme ts Ser/Thr kinase, essential for DNA 
replication 

HIL VACWR099 20 Core enzyme ind Tyr/Ser/Thr phosphatase 

O2L VACWR069 12 Core enzyme ko Glutaredoxin, nonessential, not conserved 

A45R VACWR171 14 Core enzyme ko Inactive superoxide dismutase, nonessential 


” Copenhagen designation. 
t predicted, apparent MW may differ. 
+ ind, inducible; ts, temperature sensitive; ko, knockout. 


$ TM, transmembrane domain, number of domains in parentheses. 
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mechanisms; an understanding of these mechanisms is important for 
proper evaluation of mutant phenotypes. 

In inducible mutants, the gene in question is engineered such that 
its expression is controlled by the regulatory elements of a bacterial 
operon, commonly either lac or tet (Rodriguez and Smith, 1990a; 
Traktman et al., 2000; Zhang and Moss, 1991a). Thus, in the presence 
of inducer, the wild-type gene and protein in question are expressed, a 
permissive condition exists, and virus can be grown for study. In the 
absence of inducer, the gene is not expressed, a nonpermissive condi- 
tion exists, and the mutant phenotype can be studied. Importantly, 
under nonpermissive conditions in an inducible mutant, the gene 
product in question is absent or present at only low levels. 

In temperature-sensitive mutants, the gene in question contains a 
missense mutation that renders the protein product either unstable or 
nonfunctional at high temperature (Condit and Niles, 1990). Thus, 
virus is grown for study at low temperature, the permissive condition, 
and the mutant phenotype studied using infections at high tempera- 
ture, the nonpermissive condition. In mutants where the protein in 
question is unstable at high temperature, the protein is effectively 
absent from the infection and thus mechanistically the mutant is 
similar to an inducible mutant. An important caveat to this statement, 
however, is that any residual protein is mutant in nature, whereas the 
residual protein present in the case of inducible mutants is wild type. 
For temperature-sensitive mutants in which the target protein is non- 
functional at the high temperature but nevertheless physically pres- 
ent, the mechanism of temperature sensitivity is significantly different 
than in an inducible mutant. Most temperature-sensitive mutants 
extant have been isolated by classical mutagenesis and random screen- 
ing methods;”, however, methods exist whereby temperature-sensitive 
mutants can be engineered into specific genes in a targeted fashion 
(DeMasi and Traktman, 2000; Grubisha and Traktman, 2003; Hassett 
and Condit, 1994; Hassett et al., 1997; Ishii and Moss, 2001; Punjabi 
et al., 2001). 


2 Temperature-sensitive mutants are referred to using either a mutant-specific desig- 
nation or a generic protein-specific designation. The mutant-specific designation consists 
of a letter, either C, D, or E, signifying the original mutant collection, (Condit, Dales, or 
Ensinger) followed by “ts” and a number specific to the mutant, for example, Dts46 
(Lackner et al., 2003). The generic, protein-specific notation consists of “ts” followed by 
the name of the protein, for example, tsA30. 
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Each mutant type has advantages and disadvantages. Advantages 
of inducible mutants include the relative ease with which a mutant 
may reliably be targeted to a specific gene and the fact that all 
experiments can be done at one temperature, thus avoiding the 
possibility of temperature-induced abnormalities in the infection. 
Disadvantages include the inability to use this system effectively 
for viral genes expressed early in infection and the inability to en- 
gender a tight phenotype for some late genes, due to some intrinsic 
leakiness in the system. Advantages of temperature-sensitive mu- 
tants include the ability sometimes to perform temperature shift-up 
and shift-down experiments to test for the requirement for a gene at 
different times during the life cycle and the existence of some mu- 
tants which synthesize stable but nonfunctional protein at the non- 
permissive temperature. Disadvantages of temperature-sensitive 
mutants include the difficulty of isolating mutants in specific genes 
in a targeted fashion and the necessity of performing experiments at 
high temperature, which may induce mutant-independent abnormal- 
ities in the infection. In the end, the two types of mutants have 
complementary advantages and provide powerful tools for the study 
of gene function. 


III. Vaccinia VIRION STRUCTURE 


A. Imaging Studies of MV Structure 


Electron microscopic investigation of vaccinia virus structure has 
involved analysis of both whole mount preparations of purified virus 
and thin sections of purified virions or virus-infected cells. Each ap- 
proach has strengths and limitations that are important to consider 
in deriving an overall model for the structure of the virus. In overall 
dimensions, the virion is highly asymmetric, described variously as 
ellipsoidal, brick shaped, or barrel shaped (see the following para- 
graph). As a result of this asymmetry, in most whole mount pre- 
parations of virus, the majority of particles appear to be oriented in a 
nonrandom fashion, presenting the broadest surface of the virion per- 
pendicular to the electron beam, revealing clearly only the two widest 
dimensions. Whole mount preparations also provide detail relevant 
to the surface of the particle but usually reveal less information 
about the internal structure. By contrast, thin sections present all 
possible orientations and sections of the virion for viewing; however, 
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reconstructing an accurate three-dimensional (3D) view of a relatively 
complex and asymmetric structure can be daunting. While thin sec- 
tions provide minimal information about the surface of the particle, 
they reveal significant information about internal structure. 

Three studies provide estimates of the external dimensions of 
the virion that are consistent with historical measurements (Dales 
and Pogo, 1981; Fenner et al., 1989) and at the same time reflect 
some continuing uncertainty regarding the ultimate shape of the viri- 
on. Measurements of cryo-electron micrographs by Griffiths and co- 
workers suggest a brick- or pillow-shaped particle, with dimensions 
of 310 x 240 x 140 nm (Griffiths et al., 2001b; Roos et al., 1996; Sodeik 
and Krijnse-Locker, 2002). A study using atomic force microscopy 
suggests a slightly flattened ellipsoid with dimensions of 350 x 300 x 
265 nm (Malkin et al., 2003). Lastly, recent cryo-electron tomography 
of the virus provides evidence that MV are barrel shaped, with dimen- 
sions of 360 x 270 x 250 nm (Cyrklaff et al., 2005). Malkin et al. (2003) 
note that if particles are air dried and examined by atomic force 
microscopy, they shrink more than twofold along the shortest axis 
but only slightly along the longer two axes, yielding final dimens- 
ions of approximately 330 x 260 x 125 nm, provocatively similar to 
the measurements reported by Griffiths and coworkers using cryo- 
electron microscopy (Griffiths et al., 2001b; Roos et al., 1996). In sum- 
mary all three of these studies, using different approaches, suggest a 
flattened ellipsoid or barrel but leave some lingering uncertainty 
concerning the ratio of the length of the shortest axis to the longer 
axes. 

The surface of purified MV particles, as viewed in whole mount 
preparations, assumes two fundamentally different appearances de- 
pending on the integrity of the particles and the method of prepara- 
tion. Particles observed using negative staining, metal shadowing, 
freeze etch or deep etch electron microscopy, or atomic force microscopy 
possess randomly arranged surface ridges called “surface tubule 
elements” (STEs) (Fig. 1A-E) (Dales, 1962; Heuser, 2005; Malkin 
et al., 2003; Medzon and Bauer, 1970; Noyes, 1962a,b; Stern and Dales, 
1976; Westwood et al., 1964; Wilton et al., 1995). This form of virus 
was named the “M” form by Westwood et al. (1964) for its mulberry 
like appearance. Most negatively stained, purified preparations of MV 
also contain virions which lack obvious STEs but instead possess a 
clearly delineated boundary surrounding the particle (Fig. 1A). These 
latter forms were dubbed the “C” form by Westwood et al. (1964), 
because the boundary looks like a capsule surrounding the particle. 
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Visualization of purified MV in whole mount using cryo-electron 
microscopy reveals particles that closely resemble the C form of virus, 
that is, lacking evidence of STEs and possessing a boundary layer 
or “surface domain” (Fig. 1F) (Dubochet et al., 1994; Griffiths et al., 
2001b). In negatively stained preparations, treatments that are 
likely to disturb the integrity of the particle convert the M form of 
the virus to the C form (Dubochet et al., 1994; Westwood et al., 1964). 
Dubochet et al. (1994) interpreted these results to mean that the STEs 
are preparation artifacts induced by osmotic stress imposed during 





Fic 1. Vaccinia virus surface features. (A) Electron micrograph of purified virus, 
whole mount, negative stain. “Mulberry” (M) and “capsule” (C) forms are shown 
(Westwood et al., 1964). (B) Electron micrograph of a purified virion, whole mount, 
negative stain (Wilton et al., 1995). (C) Freeze etch electron micrograph of a purified 
virion (Nermut, 1973). (D) Deep etch electron micrograph of a purified virion (Heuser, 
2005). (E) Atomic force micrograph of a purified virion (Malkin et al., 2003). (F) Cryo- 
electron micrograph of a whole mount preparation of purified virions (Griffiths et al., 
2001b). Arrows denote the limits of the surface domain; arrowhead denotes the 
palisade layer. (A) Reprinted from Westwood et al. (1964) with permission. (B) Rep- 
rinted from Wilton et al. (1995) with permission. (C) Reprinted from Nermut (1973) 
with permission. (D) Reprinted from Heuser (2005) with permission. (E) Reprinted 
from Malkin et al. (2003) with permission. (F) Reprinted from Griffiths et al. (2001b) 
with permission. 
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negative staining and that the smooth surfaced C form of virus 
revealed by cryo-electron microscopy most accurately represents the 
surface of the virus. However, the persistence of STEs in metal 
shadowing, freeze etch and deep etch electron microscopy, and atomic 
force microscopy (Heuser, 2005; Malkin et al., 2003; Medzon and 
Bauer, 1970), plus the biochemical evidence for a structure resem- 
bling STEs (Wilton et al., 1995) (Sections III.C and IX.E.3) suggests 
that STEs are not artifactual. As originally proposed by Westwood 
et al. (1964), the C form of virus present in negatively stained pre- 
parations most likely results from penetration of stain into damaged 
virions, which destroys the contrast that ordinarily highlights STEs. 
The C form appearance of particles viewed by cryo-electron microsco- 
py probably results from an insensitivity of cryo-electron microscopy 
to fine surface detail (Malkin et al., 2003). 

Insights into the internal architecture of MV have been gained 
primarily through the use of thin sections, although some information 
can be obtained from whole mount preparations. Thin sections reveal 
two distinct boundaries, the membrane,’, which surrounds the entire 
MV particle, and the core wall, which surrounds an internal core. 
Cryosections, such as that shown in Fig. 2A, offer the clearest defini- 
tion of the substructure of the membrane and the core wall, each 
of which comprises two layers or domains. Estimates of the thicknesses 
of each of these domains vary, and the numbers bear significance to 
the biochemical makeup of each domain (discussed in Section VI.A.3) 
(Cyrklaff et al., 2005; Dubochet et al., 1994; Fenner et al., 1989; 
Hollinshead et al., 1999; Ichihashi et al., 1984). The outer domain 
of the membrane (outermost dark layer in Fig. 2A) is between 3 and 
9 nm thick, the inner domain (light layer underneath the outer domain 
and surrounding the entire particle in Fig. 2A) 5-6 nm thick, and 
estimates of the thickness of the membrane in aggregate range from 
10 to 20 nm. The outer core wall (dark layer surrounding the core 
in Fig. 2A) is clearly thicker than any of the four domains comprising 


3 Dales and coworkers proposed the term “envelope” to describe the MV outer bound- 
ary, based on its lipid content and its function analogous to envelopes of other viruses 
(Dales and Pogo, 1981). However, the common usage of the term envelope in referring to 
WV and EV (heretofore called IEV, CEV, and EEV) invites confusion over the origin, 
structure, and function of the MV outer boundary. We use the term “membrane” to 
describe the MV outer boundary; however, it is critical to understand that this two- 
domain structure is clearly significantly more complex than a simple single lipid mem- 
brane bilayer. Details of the substructure, lipid and protein composition of the MV 
membrane are provided in subsequent sections. 
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Fic 2. Vaccinia virus internal features. (A) Cryosection (Hollinshead et al., 1999). (B) 
Whole mount cryo-electron micrograph using uranyl acetate staining (Griffiths et al., 
2001b). The authors interpreted images of isolated particles as revealing significant 
asymmetry in the virion structure, two lipid bilayer membranes, and infoldings of the 
membranes (Section VI, Fig. 8, Model 4). Hence, the “R” indicates the right hand side of 
the particle. The authors state that “The particle has attached to the grid support in a 
manner such that a classical side view is evident. The two membrane layers are indicated 
with large arrows, and the spikes are indicated by an arrowhead. The star indicates the 
projection of one of the lateral bodies that we believe is due to the overlapping of the 
peripheral lobes of the virus (top and bottom). The small arrow indicates an inward 
groove in the membrane.” The image is one of the best available showing the internal 
structure of the virion in a whole mount preparation of virus. (C, F, and I) Three different 
projections of a whole mount preparation of virus, phosphotungstic acid stained (Peters 
and Mueller, 1963). (D, G, and J) Sections of purified virus, uranyl acetate stained, in 
three different perpendicular planes (Peters and Mueller, 1963). (E, H, and K) Sections of 
MV in three different perpendicular planes (Kato et al., 2004; Condit, unpublished). (E) 
and (H) are virions in infected cells; (K) was from a preparation of purified virus. (A) 
Reprinted from Hollinshead et al. (1999) with permission. (B) Reprinted from Griffiths 
et al. (2001b) with permission. (C, D, F, G, I, and J) Reprinted from Peters and Mueller 
(1963) with permission. (K) Reprinted from Kato et al. (2004) with permission. 


the membrane and the core wall; estimates of the width of the outer core 
wall range from 8 to 17 nm. The outer core wall has a striated appear- 
ance and therefore has been called the “palisade layer.” The inner core 
wall or “smooth layer” (innermost light layer in Fig. 2A), has been 
estimated at 5-8 nm in thickness. While biochemical evidence suggests 
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that the layers of the core wall are in fact biochemically distinct, the 
layered appearance of the membrane may simply reflect a structural 
asymmetry within an otherwise unitary structure. Over much of its 
surface, the outer core wall appears to be closely apposed to the inner 
domain of the membrane, as is clearly seen at the top and bottom of 
the particle in Fig. 2A. Appropriate sections of the particle reveal 
the central bilateral concavities in the core, formed by a separation 
of the core wall from the membrane. Filling these concavities between 
the core wall and the membrane are two apparently amorphous masses 
called “lateral bodies,” as shown clearly in the particles in Fig. 2A and K. 

Importantly, different sections through the virion yield very different 
impressions of the relationship between the core and the membrane. 
Figure 2C—K shows sections in all three possible spatial planes. Sections 
through the broadest plane (Fig. 2C—E) closely resemble whole mount 
cryo-EM images of purified virus or the C form of virus viewed by nega- 
tive staining. Thus, the capsule of the C form and the surface domain of 
the cryo-EM images most likely represent the four domains of the mem- 
brane and the core wall, tightly apposed with no separation between the 
core wall and the membrane and no evidence of lateral bodies. While 
most whole mount preparations of virus seem to be nonrandomly orient- 
ed in this fashion with the broadest plane perpendicular to the electron 
beam, particles are occasionally oriented on their sides, revealing 
the biconcave shape of the core and hinting at lateral bodies (Fig. 2B) 
(Griffiths et al., 2001b). Sections through the other two planes perpendic- 
ular to the broadest plane (Fig. 2F—K) often reveal the lateral bodies but 
may not if they are cut close to the ends or the sides of the particle. 

The core contains a tube-like structure first detailed by Peters and 
Mueller (1963) (Fig. 2C—K). Images that clearly reveal the length of 
the tube are rare, possibly because this structure is relatively labile 
(Peters and Mueller, 1963) and also because sections must be cut 
precisely parallel to, and must bisect, the broadest plane of the virus 
in order to clearly visualize the tube. Images of the tube in cross 
section are more common, although still relatively rare (Fig. 2F—H) 
(Heuser, 2005). 


B. Chemical Composition of MV 


Whole MV particles contain approximately 3.2% (of dry weight) 
DNA and 5% lipid, the balance of the mass presumably being composed 
of protein (Zwartouw, 1964). There is general agreement that the 
particle does not contain significant amounts of RNA. In addition, 
the presence of spermine and spermidine in MV has been reported; 
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as described in Section VIII, these polyamines may play a role in 
condensation of the viral DNA (Lanzer and Holowczak, 1975). 

The lipid composition of MV is generally similar to that of the host 
cell, with two significant differences. First, the fraction of phosphatidyl 
ethanolamine is about one-third less than the host cell and second, MV 
lipid contains approximately 25% of a biphosphatidic acid analog, acyl 
bis(monoacylglycero)phosphate, three to four times the amount found 
in host cells (Hiller et al., 1981; Stern and Dales, 1974). These differ- 
ences carry implications for the origin of the lipid in MV, discussed in a 
later section. 


C. Controlled Degradation of MV 


Additional detail regarding the fine structure of MV has been obtained 
through controlled degradation of the particle, followed by electron mi- 
croscopic analysis of subparticle fractions. Easterbrook (1966) first de- 
monstrated that treatment of MV with a neutral detergent (NP40) anda 
reducing agent (2-mercaptoethanol or dithiothreitol (DTT)) solubilizes 
the virion membrane, leaving behind an intact core with lateral bodies 
attached. This observation, coupled with the observation that MV can 
enter cells by fusion (Armstrong et al., 1973; Carter et al., 2005), demon- 
strates that the MV membrane contains a lipid bilayer, which probably 
corresponds to the innermost of the two visible domains comprising the 
MV membrane (Section II.A) (Hollinshead et al., 1999). Ichihashi et al. 
(1984) observed that treatment of MV with SDS alone yielded a “ghost” 
particle, which remained bounded by a structure that resembles the 
membrane but is reduced in thickness from 20 to 5-7 nm. This finding 
implies that the protein component(s) of the membrane comprise an 
extensive disulfide bonded network (Section VI.D.2). 

Careful titration of NP40 and 2-mercaptoethanol into suspensions of 
MV yields cores mixed with an abundance of tubular structures (Fig. 3A) 
(Wilton et al., 1995). Wilton et al. (1995) equated these structures with 
STEs. While the isolated tubules are not identical in dimension and 
structure to STEs, the overall similarity in appearance is compelling. 
After complete solubilization of the MV membrane with saturating 
amounts of NP40 and 2-mercaptoethanol, the core, with lateral bodies 
attached, can be purified from the solubilized membrane fraction by 
differential centrifugation (Fig. 3B). (It is noteworthy that most investi- 
gators refer to this NP40 and 2-mercaptoethanol insoluble fraction as the 
“core” fraction, when it in fact contains both the virion core and the lateral 
bodies.) Isolated cores no longer possess concavities and reveal clearly the 
details of the outer palisade layer of the core wall (Fig. 3C and D) 
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Fic 3. Vaccinia virion substructures. (A) Negatively stained preparation of MV after 
mild treatment with NP40 and 2-mercaptoethanol, so that surface tubule elements 
remain intact (Wilton et al., 1995). Bar, 0.2 um. (B) Isolated core in thin section. Note 
lateral bodies still attached (Ichihashi et al., 1984). (C) Isolated core visualized by cryo- 
electron microscopy (Dubochet et al., 1994). (D) Isolated core visualized by negative 
staining (Wilton et al., 1995). (A and D) Reprinted from Wilton et al. (1995) with 
permission. (B) Reprinted from Ichihashi et al. (1984) with permission. (C) Reprinted 
from Dubochet et al. (1994) with permission. 


(Dubochet et al., 1994; Easterbrook, 1966; Wilton et al., 1995). The 
palisade is composed of a lattice, sometimes hexagonal, of cylindrical 
pegs measuring approximately 5 nm in diameter and 10 nm in length 
(Dubochet et al., 1994; Easterbrook, 1966; Westwood et al., 1964). The 
lateral bodies and palisade layer sometimes spontaneously dissociate 
from a fraction of isolated cores, revealing the inner smooth layer and 
thus substantiating the two-layer character of the core wall. Treatment of 
the NP40, 2-mercaptoethanol insoluble fraction with trypsin actively 
removes the lateral bodies, suggesting that the lateral bodies are protein- 
aceous in nature (Easterbrook, 1966). Within the core resides the viral 
DNA genome. 


D. Structure of the Genome in the Core 


Investigation of the core substructure suggests that the viral 
genome exists in the core complexed with viral protein. Rupture of 
MV by various methods can release 30—40-nm diameter tubules that 
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possess an apparently helical substructure (Griffiths et al., 2001b; 
Malkin et al., 2003). The relationship, if any, between these tubules 
and the 50-nm diameter tube contained in the core (Peters and 
Mueller, 1963) is unclear. Treatment with protease reduces the 
30-nm tubes to 16-nm fibers, also helical in substructure, which coat 
the viral DNA (Malkin et al., 2003). Holowezak and coworkers, using 
various treatments with detergents and denaturing agents, isolated 
from MV DNA-containing substructures of varying complexity 
they termed “nucleoids” and “subnucleoids” (Holowczak et al., 1975; 
Soloski and Holowezak, 1980, 1981; Soloski et al., 1979; reviewed in 
Holowczak, 1982). Subnucleoids, the simplest of these structures, 
contained four proteins apparently complexed with supercoiled DNA 
in 30-60-nm spherical structures interconnected with DNA fibers, 
roughly reminiscent of a very loose nucleosome structure. In summary, 
these studies suggest that the vaccinia genome exists within the core 
complexed in an organized fashion with viral proteins. However, this 
conclusion is still controversial. As described later in Section VIII.B, 
repression or inactivation of two different proteins required for genome 
encapsidation leads to the production of MV that lack the viral ge- 
nome. These virions contain what appears to be the full complement of 
viral proteins: the absence of the encapsidated genome is not accom- 
panied by the absence of an abundant DNA-wrapping protein. It may 
be that association of the genome with viral proteins to form a nucleo- 
protein complex occurs after genome encapsidation, within the context 
of the nascent virion core. 


E. MV Proteins 


Historically, the protein composition of MV has been analyzed using 
three fundamentally distinct approaches. First, virion proteins have 
been cataloged by electrophoretic resolution of whole virions or virion 
subfractions (Essani and Dales, 1979; Ichihashi et al., 1984; Jensen 
et al., 1996; Sarov and Joklik, 1972; Takahashi et al., 1994; Wilton 
et al., 1995). Second, numerous proteins have been purified from MV 
by virtue of their enzymatic activities and subsequently characterized 
(Martin et al., 1975; Morgan et al., 1984; Niles et al., 1989). Finally, 
antibodies have been generated for individual gene products in a tar- 
geted fashion, allowing proteins to be subsequently localized to virions 
by immunoblot analysis (da Fonseca et al., 2000a, 2004). Early attempts 
at cataloging the protein content of virions by gel electrophoresis relied 
on apparent molecular weights as a means of identifying proteins. 
Because of inconsistencies in molecular weight determinations, these 
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studies are often difficult to compare with each other, and an unambig- 
uous assignment of a specific protein from the older literature to a 
discrete vaccinia gene is prone to error. More recently, the combination 
of genetics and protein sequencing techniques has made possible the 
unambiguous correlation of genes with virion proteins. We have con- 
fined ourselves in this review exclusively to those virion proteins for 
which a positive gene identification has been made. 

A complete list of the virion proteins for which a specific gene has 
been identified is given in Table I. These virion proteins can be sorted 
into four groups based on whether or not they are solubilized by NP40 
and a reducing agent and whether or not they possess enzymatic 
activity: (1) proteins that are solubilized by NP40 and a reducing agent 
but have not been assigned an enzymatic activity are presumed to be 
membrane structural proteins; (2) proteins that are solubilized by 
NP40 and a reducing agent and also possess an enzymatic activity 
are classified as membrane enzymes; (3) proteins that are not solubi- 
lized by NP40 and a reducing agent and have not been assigned an 
enzymatic activity are presumed to be core structural proteins; and (4) 
proteins that are not solubilized by NP40 and a reducing agent and 
also possess an enzymatic activity are classified as core enzymes. For 
completeness, Table I also includes two proteins (D13 and A11) that 
affect virion morphogenesis but are not encapsidated and two proteins 
(A32 and A2.5) whose subvirion localization remains to be determined. 
The list given in Table I is undoubtedly incomplete but is probably 
approaching saturation. Indeed, a comprehensive catalog of specific 
vaccinia gene products contained in MV has been compiled based on 
gel-free liquid chromatography and tandem mass spectroscopy of 
whole virions (Chung et al., 2006). These new data correlate remark- 
ably well with the data in Table I and identify an additional 10 proteins 
(C6, F8, F9, E6, I3, A6, A25, A31, A42, and A46), whose virion locali- 
zation is currently either disputed or unconfirmed by other techniques. 
Because of the possibility of nonspecific association of proteins with 
virions (Franke and Hruby, 1987), we have not included in Table I 
these 10 new candidate virion proteins and have instead focused at- 
tention on proteins whose presence in the virion has been confirmed 
by additional experiments. A few general features of the catalog of 
virion proteins are highlighted here, and otherwise the details of the 
contributions of individual proteins to structure and morphogenesis of 
the virions are detailed in the remainder of the review. 

To date, a total of 22 proteins have been localized to the virion mem- 
brane. Of these, 2 are enzymes associated with a virus coded redox 
pathway responsible for maintenance of some of the disulfide bonds in 
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the membrane, while the remaining 20 proteins can be thought of as 
structural proteins. Eighteen of the 20 structural proteins are integral 
membrane proteins containing at least one transmembrane domain. 
Interestingly, none of these proteins are glycosylated in the MV mem- 
brane, reflecting the unusual origin and structure of the membrane. 

To date, 47 proteins have been localized to the virion “core,” that is, 
the NP40, reducing agent insoluble fraction, which includes both the 
core and the lateral bodies. Of these 47 proteins, 19 have no known 
enzymatic function and are presumed to be structural proteins. Six- 
teen of the enzyme proteins have well-characterized roles in early viral 
mRNA synthesis, including initiation, elongation and termination of 
transcription, mRNA capping and polyadenylation. The remaining 
enzymes play less clearly defined roles in the viral life cycle but have 
in some cases been shown to interact with and/or modify both proteins 
and nucleic acids. It is important to note that virion enzymes may have 
structural as well as enzymatic roles in the virion. While attempts 
have been made to sublocalize proteins to core nucleoprotein, inner 
core wall, outer core wall, and lateral bodies (Ichihashi et al., 1984; 
Sarov and Joklik, 1972), these studies date from the pregenomic era 
and few if any unambiguous assignments of core proteins to core 
substructures have been made. 

Finally, several cellular proteins have been found associated with 
purified particles (Castro et al., 2003; Chung et al., 2006; Jensen et al., 
1996; Webb et al., 1999). It is possible that some of these proteins 
adhere to the MV during purification or are incorporated nonspecifi- 
cally during virus maturation (Franke and Hruby, 1987). However, 
cyclophilin A has been clearly demonstrated to be packaged into virus 
cores (Castro et al., 2003). The role of this protein in virus formation is 
not known, although it could participate in the trafficking of virus 
proteins to the factories or in the proper folding of viral proteins during 
the maturation process. 


F. Model for MV Structure 


Based on the evidence presented previously, we have constructed a 
schematic 3D model of the vaccinia virion. The model is not intended to 
be definitive in its details, especially given the continuing uncertainty 
regarding dimensions of the virion and its substructures. Neverthe- 
less, we feel that such a schematic will clarify some of the microscopic 
images available and the interrelationships of the virion components. 
For external dimensions we have used the measurements determined 
by Cyrklaff et al. (2005) using cryo-electron tomography, and for 
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internal dimensions we have used the measurements determined by 
Hollinshead et al. (1999) from cryosections (Fig. 2A). Thus the virus 
appears as a slightly flattened barrel (Fig. 4A) with overall dimensions 
of approximately 360 x 270 x 250 nm. The particle is encased in an outer 
membrane, which itself consists of two component domains. The outer- 
most membrane domain is 9 nm thick, and the innermost membrane 
domain is 5 nm thick (Fig. 4C—F). The membrane clearly contains lipid; 
however, the details of the lipid bilayer content and structure of MV are 
the subject of some debate and will be therefore be discussed at length 
later in the review (Section VI.A). Within the membrane is the core, 
which is also barrel shaped but contains two indentations, one on each 




















Fic 4. A model for vaccinia virion structure. (A) The intact MV. No attempt 
has been made to represent surface tubule elements. (B) The virion core. (C) Cutaway 
view. The membrane has been removed from the upper half of the virion, the near end 
has been removed, and the core wall has been rendered transparent, thus revealing 
the multiple layers, the concavities in the core, the lateral bodies, and the tubular 
internal structure. (D and E) Sections through the virion in three different perpendi- 
cular planes. The sections correspond to sections shown in Fig. 2. See text for 
details. (Model courtesy of Michel Moussatche. A dynamic 3D model is available online 
at http://www.vacciniamodel.com/.) 
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of the largest surfaces (Fig. 4B). The core is defined by a core wall, which 
is also composed of two layers, an outer, thicker, striated “palisade” layer 
17 nm thick, and an inner smooth layer 8 nm thick (Fig. 4C—F). Filling 
the spaces between the core wall and the membrane that are created 
by the indentations in the core are “lateral bodies.” Within the core is a 
tube-like structure, 50 nm in diameter with a 10 nm diameter core. This 
tube is folded upon itself into three continuous segments, with a total 
length of approximately 250 nm. Importantly, sections through the cen- 
ter of the particle in each of the three possible spatial planes yield three 
significantly different impressions of the overall shape and internal 
structure of the particle (Fig. 4D—F). For comparison, the electron mi- 
crographic images in Fig. 2C—E correspond to the model section shown 
in Fig. 4D; Fig. 2F—H correspond to Fig. 4E; Fig. 2I-K corresponds to 
Fig. 4F. A dynamic 3D rendering of the model can be found online at 
http://www.vacciniamodel.com/. 


IV. Vaccinia VIRUS MORPHOGENESIS: AN OVERVIEW 


The balance of this review focuses on the details of the assembly of 
vaccinia, with particular emphasis on lessons learned from the study 
of mutants affected in individual virus genes that influence virus 
morphogenesis. To establish a context for this discussion, we offer here 
a bird’s-eye view of the entire assembly process. 

The temporal sequence of events comprising vaccinia assembly was 
first deciphered experimentally by electron microscopic examination of 
cultured cells synchronously infected with virus (Dales, 1963; Dales 
and Siminovitch, 1961). At very early times following uptake of virus 
and dissolution of the core particle, infection-specific cytoplasmic do- 
mains are observed that are uniform in density, contain few if any 
cellular organelles, and are sometimes surrounded by ER derived 
cisternae (Fig. 5A and B) (Tolonen et al., 2001). These domains repre- 
sent sites of viral DNA replication, sometimes called “factories,” “viral 
factories,” or “DNA factories,” and they increase in size with time. The 
earliest evidence of virus assembly is the appearance within factories 
of rigid crescent-shaped structures, cupules in three dimensions, 
10-15 nm in thickness (Figs. 5C and 6). In most electron micrographs 
crescents comprise two distinct layers, an inner, smooth layer that at 
high resolution in fixed embedded sections has the trilamellar appear- 
ance of a lipid bilayer and an outer layer composed of regularly spaced 
projections termed “spicules” (Dales and Mosbach, 1968; Risco et al., 
2002). It is universally accepted that crescents contain at least one 
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Fic 5. Vaccinia factories. (A) An early factory, 4 hpi. The factory is represented by the 
central cleared area surrounded by cytoplasmic organelles. Bar, 1 um. (Kato and Condit, 
unpublished) (B) A factory 2 h, 45 min pi (Tolonen et al., 2001). The star indicates the factory. 
G, Golgi stack. M, mitochondria. Bar, 0.2 um. (C) A factory late during infection (Rodriguez 
et al., 1998). N, nucleus. IV, immature virions. Arrows indicate IVN. Arrowheads indicate 
MV. Note that the MV are removed from the factory. (B) Reprinted from Tolonen et al. (2001) 
with permission. (C) Reprinted from Rodriguez et al. (1998) with permission. 


lipid bilayer, although the precise membrane makeup of crescents 
has been the subject of some debate, discussed in more detail in 
Section VI.A. Crescents apparently grow in length while maintaining 
the same curvature until they become closed circles, spheres in three 
dimensions, called immature virions (IV) (Figs. 5C and 6). IV are 
approximately 350 nm in diameter and are filled with “viroplasm,” 
material that is uniform in density but discernibly more electron dense 
than the surrounding factory. Viroplasm appears not only in IV but it 
also often appears to fill the concavities defined by crescents. Viroplasm 
is also often present as relatively large sub domains, “virosomes,” within 
the surrounding factory, sometimes unbounded, sometimes surrounded 
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Fic 6. Vaccinia morphogenesis intermediates. (1) Crescents. (2) IV. (3) IVN. (4) Structures 
intermediate between IV and MV. (5) MV. Reprinted from Moss (2001) with permission. 


by multiple crescents that appear to “bite off’ portions of viroplasm. 
Appearing at approximately the same time as IV are IV which contain 
an electron dense, round or ovoid subdomain called a “nucleoid” (Figs. 5C 
and 6). We refer to IVs that contain nucleoids as “IVN.” Nucleoids contain 
DNA (Dales, 1963; Ericsson et al., 1995). Nucleoid material is frequently 
observed spanning a small gap in a nearly complete IV, so that it lies 
partially inside and partially outside of the particle (Morgan, 1976a). 
Under some conditions, larger inclusions of DNA-containing nucleoid 
material are found within factories, and may appear as either large, 
spherical, granular inclusions or alternatively paracrystalline arrays. 
Importantly, serial sections reveal that most if not all IVs contain 
nucleoids, so that in fact in a normal infection, there may be no real 
distinction between IVs and IVNs, rather, an IV is merely an IVN that 
has been sectioned through a plane that does not include the nucleoid 
(Morgan et al., 1955). Following the appearance of IVNs, MV appear 
(Figs. 5C and 6). Proteolytic cleavage of several virion protein precursors 
to a mature form accompanies, and in fact is required for, morphogenesis 
from IVN to MV (Ansarah-Sobrinho and Moss, 2004b; Katz and Moss, 
1970b). The majority of MV are found outside factories and may exist in 
clusters either at the periphery of a factory or apparently separated by a 
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significant distance from the nearest factory. Particles which have a 
structure that appears to be intermediate between IVN and MV have 
been described, but they are rare, suggesting that the transition from IV 
to MV is a rapid and concerted process (Fig. 6). 

For the purposes of this review, we have artificially divided vaccinia 
morphogenesis into stages, including formation of factories, crescents, 
IV, IVN, and MV. A summary of the entire process is shown in Fig. 7. We 
now consider the details of each of these stages separately. Critical to the 
understanding of the assembly process is the phenotypic analysis of 
infections performed with conditional lethal mutants; a complete list 
of the currently available mutants affected in genes that encode proteins 
required for virion morphogenesis or infectivity is shown in Table II. 





Fic 7. Vaccinia morphogenesis summary. Following the arrows in sequence from the 
upper left corner: Several integral viral membrane proteins (black Xs) are made in the 
ER (red lines) and transported to viral factories (gray area) along with ER derived lipid to 
be assembled into crescents which contain a lipid bilayer (red line) and the membrane 
proteins, scaffolded on a honeycomb structure composed of the D13 protein (yellow 
circles). Crescent formation is controlled by phosphorylation. The crescents mature to 
IV and IVN, accompanied by encapsidation of the genome. The dotted line surrounding 
IV, IVN, and encapsidation signifies that the order of these events is uncertain. Meta- 
morphosis to MV is accompanied by loss of the D13 scaffold, proteolysis, further addition 
of membrane proteins (blue line) and movement of particles outside of factories. MV 
acquire Golgi derived membranes (green lines) to become WV and are exocytosed 
through the plasma membrane (black line) to become EV. 
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TABLE II 
VACCINIA MORPHOGENESIS GENETICS 
Phenotype* 

Gene Identity Location Mutants Cres IV MV EV Telo Cleave DNA Notes on phenotype 
J4R rpo22 Core ts = = = Factories only 
J6R rpol47 Core ts — — z Factories only 
F10L Kinase, Core ts, ind = = = — S Factories only 

7 complex 
H5R Core ts a T = z + = Curdled virosomes or DNA 
GbR Core ts — - — — + — Curdled virosomes 
Al1R Absent ind — — — — Some normal and aberrant 

virosomes 

A17L A17-A24-A27 Membrane ind =- — — — — Virosomes, vesicles 

complex 
Al4L A17-A24-A27 Membrane ind, ts — — — = — Virosomes, vesicles 

complex 
D13L TV scaffold Absent ind, ts — — — — — Aberrant crescents, rif 

protein resistance locus 
A30L 7 complex Membrane/ ind, ts + — — — — Empty IV 

core 
G7L 7 complex Core ind, ts + = = — Virosomes + crescents (ts), or 
empty IV (ind) 
JIR 7 complex Membrane/ ind, ts + — — — + — Aberrant, partial, empty IV 
core 

A15L 7 complex Core ind + — — — — Empty IV 
D2L 7 complex Core ind, ts + — — — — Empty IV 
DƏR 7 complex Core ts + — — — — Empty IV 
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A10L 


A13L 


A32L 
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F17R 
A4L 


A3L 
GIL 


I7L 
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Aberrant, empty IV’s, DNA 
nucleoid aggregates 


No IVN, membraned DNA 
crystalloids 


Dense spherical particles 
lacking DNA 


Dense spherical particles 
lacking DNA 


Dense spherical particles 


IVN, dense spherical particles, 
viroplasm 


IVN, dense spherical particles, 
viroplasm 
IVN, DNA crystalliods 


Some aberrant, unstable IV 
IVN 


Aberrant MV 
Aberrant MV 


Aberrant MV 


Some aberrant IV 
Unstable IV, IVN 


Some MV, defective, viroplasm 
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TABLE II (continued) 




















Phenotype* 
Gene Identity Location Mutants Cres IV MV EV Telo Cleave DNA Notes on phenotype 
E10R Thiol Membrane ind + + — — — 
oxidoreductase 
G4L Thiol Membrane ind + + — — — 
oxidoreductase 
A2.5L Thiol Virion ind + + — — 
oxidoreductase 
H4L rap94 Core ind, ts + + + Noninfectious MV, transcription 
enzymes missing 
I8R RNA helicase Core ts + + + + Transcription defective MV 
H6R Topoisomerase Core ko + + + + MV show reduced infectivity, 
reduced early txn 
L4R VP8 Core ind + + + + Noninfectious, unstable, 
transcription defective MV 
HIL Phosphatase Core ind + + + Noninfectious, 
transcription defective MV 
E8R Core ts + + + + Noninfectious, 
transcription defective MV 
L3L Core ind + + + + + + Noninfectious, 
transcription defective MV 
L5R Membrane ind + + + + Noninfectious MV, fusion/entry 














defective 


T9 



























































A21L Membrane ind + Noninfectious MV, fusion/entry 
defective 

A28L Membrane ind, ts + Noninfectious MV, fusion/entry 
defective 

H2R Membrane ind Noninfectious MV, fusion/entry 
defective 

Al16L Membrane ind Noninfectious MV, fusion/entry 
defective 

G3L Membrane ts Noninfectious MV 

DPL Membrane ind + Noninfectious MV, fusion/entry 
defective 

D8L Membrane ko Reduced infectivity MV, cell 
binding 

A27L A17-A14-A27 Membrane ind, ko - Infectious MV, defective in 

complex EV formation 

A26L p4c Membrane ko + Infectious MV; empty ATI 

A14.5L Membrane ko Infectious MV, attenuated in 
animals 

EIL Core ts 

A45R Inactive SOD Core ko + Nonessential 

K4L Nicking/joining Core ko + Nonessential 

enzyme 


“ Mutant designations: ind, inducible; ts, temperature sensitive; ko, knockout; Cres, crescents; Telo, telomere resolution; Cleave, 
virion protein proteolyzed; DNA, purified particles contain DNA; +, normal structure formed; —, normal structure not formed; blank, not 
reported. 
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V. FORMATION OF FACTORIES 


Early studies by Cairns (1960) demonstrated that the number of fac- 
tories observed early during vaccinia infection was proportional to the 
multiplicity of infection and concluded that each factory observed arose 
from a single infecting particle. Factories represent sites of viral DNA 
replication: factories become labeled with DNA precursors as assessed by 
EM autoradiography, factory formation is absent or severely restricted 
following infection with mutants defective in DNA replication, and infec- 
tion with mutants that are DNA replication competent but defective in 
late viral protein synthesis results in formation of large factories that are 
lacking in crescents or any other evidence of virus morphogenesis 
(Cairns, 1960; Hooda-Dhingra et al., 1989).*, Most reports described 
early factories as unbounded in the cytoplasm (Fig. 5A) (Dales and 
Kajioka, 1964); however, recent reports suggest that at early times, 
before the appearance of any specific viral structures, factories are tran- 
siently surrounded by cellular ER-derived membrane cisternae (Fig. 5B) 
(Doglio et al., 2002; Tolonen et al., 2001; Traktman, unpublished). 


VI. CRESCENT FORMATION 


A. One Membrane or More? 


The earliest detailed descriptions of the substructure of vaccinia 
viral crescents concluded that they consisted of a single lipid bilayer 
coated on its outer surface with a layer of spicules, which conferred the 
rigid arched shape to the crescents. Furthermore, these studies con- 
cluded that the lipid bilayer of the crescent formed within factories 
without any direct connection to other existing cellular membranes 
(Fig. 8, Model 1) (Dales and Mosbach, 1968). This de novo formation 
model, although unique in biology, nevertheless stood as the accepted 
model until it was challenged in 1993 by Griffiths and coworkers 
(Sodeik et al., 1993). Central to this challenge was the commonly held 


4 A significant body of literature has made use of inhibitors of DNA replication to 
study morphogenesis. For example, hydroxyurea has been used to segregate early and 
late stages of morphogenesis (Morgan, 1976a; Pogo and Dales, 1971). It is now clear that 
in the absence of DNA replication, factory formation and therefore morphogenesis is 
inhibited. It seems probable that in reports where factory formation and some morpho- 
genesis were observed in the presence of DNA replication inhibitors, the inhibitors were 
used under conditions that were not completely inhibitory, and thus some DNA replica- 
tion and late viral protein synthesis occurred. These studies should be interpreted with 
this caveat in mind. 
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Fic 8. Models for membrane processing during vaccinia morphogenesis. In each case, a 
single solid red line represents a single lipid bilayer. DNA/nucleoid material is represented 
in gray; lateral bodies are represented in blue. Four models are shown in four columns. Each 
row represents and compares a discrete state of morphogenesis, with the top row represent- 
ing crescents, the middle row representing immature virions with nucleoids (IVN) and the 
bottom row representing mature virions (MV). Model 1: Crescents and IVN contain a single 
lipid bilayer, which becomes a single lipid bilayer of the structure surrounding MV (Dales 
and Mosbach, 1968; Hollinshead et al., 1999). In this case the core wall does not contain lipid 
and is therefore represented in black. Model 2: Crescents are derived from cisternae and 
comprise two tightly apposed lipid bilayers. The ends of cisternae fuse or are sealed to form 
IVN containing two tightly apposed bilayers. During MV formation from IVN, the inner 
bilayer separates from the outer bilayer so that the inner bilayer becomes a component of 
the core wall while the outer bilayer remains as a component of the MV membrane (Sodeik 
et al., 1993). Model 3: Crescent and IVN formation is similar to Model 2; however, the 
cisternae do not fuse but are “sealed.” During morphogenesis to MV the two bilayers do 
not separate, and thus the MV membrane contains two tightly apposed bilayers (Roos et al., 
1996). As in Model 1, the core wall does not contain lipid and is therefore represented in 
black. Model 4: Crescents are derived from cisternae as in Models 2 and 3; emphasis is 
placed on the continuity of the rigid crescent domain of this cisternae with extensive flaccid 
cisternal domains. DNA in a prenucleoid structure is associated with the flaccid cisternal 
domain. IVN is formed by the infolding of the flaccid domain within the rigid crescent 
domain. The cisterna of IVN are sealed so that the resulting MV contains four lipid bilayers, 
two forming the virion envelope and two forming the core wall (Griffiths et al., 2001a; Sodeik 
and Krijnse-Locker, 2002). 


view that all cellular membranes are derived from preexisting orga- 
nelles (Palade, 1983). Sodeik et al. (1993) proposed that crescents in 
fact were derived from preexisting cisternae of the cellular secretory 
machinery, which had collapsed on themselves through the action of 
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virus proteins. Crescents formed in this manner would consist of two 
tightly apposed bilayers rather than a single bilayer (Fig. 8, Model 2). 
(Crescents would therefore be analogous to the membranes involved in 
the development of spherical autophagosomes, which are delimited by 
a double bilayer [Reggiori and Klionsky, 2005].) Investigations into the 
consequences of this cisternae-based model for crescent formation 
ultimately resulted in three different new models for IV and MV 
formation, shown in Fig. 8, Models 2—4. In the balance of this section, 
we describe and evaluate these models. 


Model 1. Single membrane bilayer (Dales and Mosbach, 1968). As 
described previously, this model states that crescents arise de novo, 
that is, without apparent continuity with other preexisting membrane 
organelles, and that crescents contain a single lipid bilayer coated on 
its outer surface with a layer of spicules. During morphogenesis from 
IVN to MV, the spicules are lost, the surface undergoes some additional 
modifications, and the contents of IVN are reorganized to yield the core 
and lateral bodies (Essani et al., 1982; Stern and Dales, 1976). In this 
model, the core is not surrounded by a lipid membrane (Wilton et al., 
1995). The primary objections to this model are that it seems to violate 
a principle which dictates that all membranes in the cell are derived 
from preexisting organelles and that at the edges of a crescent, the 
hydrophobic core of the bilayer is left exposed to the cytoplasm, a 
theoretically unstable condition. The model is supported by the fact 
that it is consistent with some existing microscopic data and that it 
yields a single membraned MV that can enter cells by a simple fusion 
event at the plasma membrane. 

Model 2. Double membrane bilayer; membrane bound core (Sodeik 
et al., 1993). As described previously, this model states that crescents 
actually contain two tightly apposed lipid bilayers, formed by collapse 
of preexisting cellular membrane cisternae, modified by virus proteins. 
The model states further that during morphogenesis from IVN to MV, 
the two membranes separate and the innermost membrane collapses 
around the core, so that both the MV surface and the core are each now 
bounded by a single lipid bilayer. The primary theoretical objections to 
this model are that it introduces a problem of how the material making 
up the lateral bodies is transported across the innermost membrane 
during morphogenesis from IV to MV and that it would result in 
delivery of a membrane-bound core to the cytoplasm if MV were to 
enter cells by a simple fusion event at the plasma membrane. The 
model is supported by numerous electron micrographs which seem to 
show continuity between crescents and cisternae (Ericsson et al., 1997; 
Sodeik et al., 1993, 1994), and by electron microscopic images of MV 
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which seem to show a membrane surrounding the core (Cyrklaff et al., 
2005; Sodeik et al., 1993). 

Model 3. Double membrane bilayer; naked core (Roos et al., 1996). 
This model also proposes that crescents are composed of two tightly 
apposed lipid bilayers; however, it differs from Model 2 with respect to 
the fate of the innermost bilayer during the morphogenesis from IV to 
MV. Specifically, Model 3 states that the two bilayers do not separate 
during MV morphogenesis, so that the resulting MV contains two 
tightly apposed bilayers on its surface, and the core is not bound by a 
membrane. The model also states that the membranes surrounding 
the particle are not fused but rather joined in an overlap by a protein- 
aceous plug. Relative to Model 2, this model resolves the theoretical 
problem of transport of material across the innermost membrane dur- 
ing formation of lateral bodies but complicates further the problem 
of virus entry; in this case, a simple fusion event between the virion 
and the plasma membrane would leave the entire contents of the 
particle outside of the cell. In support of this model are the same 
micrographs that seem to show continuity between crescents and 
cisternae (Ericsson et al., 1997; Sodeik et al., 1993, 1994), coupled with 
experiments that indicate that core surface antigens are exposed when 
MV are disrupted with a reducing agent (Roos et al., 1996). 

Model 4. Folded cisternae (Griffiths et al., 2001a; Rodriguez et al., 
2006). In this model collapsed cisternae evolve distinct virus-specific 
domains, including a rigid crescent domain that is continuous with a 
DNA binding domain. IV and MV are formed by folding of these domains 
upon themselves, resulting in a particle which now contains, in effect, 
four lipid bilayers, two on the surface of MV and two surrounding the 
core. Like Model 3, Model 4 proposes that the membranes are not fused 
but rather joined in an overlap by a proteinaceous plug. Like Model 3, 
this model resolves the theoretical problem of transport of material 
across the innermost membrane during formation of lateral bodies, 
but leaves the same problem of virus entry described previously for 
Model 3. The model is supported by electron micrographs which 
seem to show the continuity of the cisternal domains, and by electron 
micrographs of isolated MV which are interpreted to reveal a complexi- 
ty consistent with the model (Griffiths et al., 2001a,b). 


The distinction among these four models is embodied by four ques- 
tions relating to crescent, IV and MV structure and assembly: (1) Do 
the crescents contain two membranes? (2) Are the crescents continu- 
ous with cisternae, and is this continuity theoretically necessary? (3) 
Are nucleoids or cores bounded by membranes? (4) How does the virus 
enter cells? Each of these questions is addressed separately below. 
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1. Do Crescents Contain Two Membranes? 


The structure of viral crescents has been investigated by various 
methods of electron microscopy, with conflicting interpretations of the 
results. The overall thickness of crescents including the inner smooth 
later and the outer “spicule” layer is 10-15 nm, at least enough to 
accommodate two tightly apposed bilayers. Occasional micrographs 
show that these layers are physically distinct in that they sometimes 
appear separated (Dales and Mosbach, 1968; Essani et al., 1982; 
Hollinshead et al., 1999; Sodeik et al., 1993). Some investigators inter- 
pret the outermost layer to be devoid of lipid membrane and conclude 
that crescents contain only one membrane corresponding to the inner- 
most layer (Dales and Mosbach, 1968; Essani et al., 1982; Hollinshead 
et al., 1999). Other investigators conclude that both layers represent 
lipid bilayers and that the spicules can obscure the outermost lipid 
bilayer in conventional electron micrographs (Risco et al., 2002; Sodeik 
et al., 1993). Freeze fracture of multiply membraned organelles often 
clearly reveals the individual membranes as steps in the fracture pat- 
tern. While one freeze fracture study of vaccinia IVs suggests a double 
membraned particle (Risco et al., 2002), most freeze fracture studies fail 
to reveal evidence of more than one membrane in IV (Heuser, 2005). 

Electron microscopy of IVs formed after reversal of a rifampicin block 
provides unique insight into the formation of crescents. As detailed in 
Section VIE, it is now clearly established that the major component of 
the spicule layer is a lattice of the D13 protein. In the presence of the 
antibiotic rifampicin, the association of D13 protein with viral mem- 
branes is prevented, so that flaccid viral membranes accumulate at the 
periphery of virosomes while D13 protein accumulates within viroplas- 
mic inclusions distant from the membranes. While some investigators 
describe only single lipid bilayers accumulating in the presence of 
rifampicin (Grimley et al., 1970), others describe both single mem- 
branes and cisternae (Sodeik et al., 1994), or “dense membranes 
(around 18 nm thick), twisted double membranes with vesicular ends, 
and tubular elements (30 nm thick), some of them with vesicles at one 
end” (Risco et al., 2002). Removal of rifampicin results in formation of 
rigid crescents within minutes, presumably via association of accumu- 
lated D13 with viral membranes. Under conditions of rifampicin rever- 
sal, multiple crescents can form on one continuous membrane, 
providing a direct comparison of crescents with spicule-free viral mem- 
branes. Under these circumstances, clear images have been obtained 
which suggest that the crescent membrane is a single lipid bilayer and 
that the spicule layer is attached in patches along this unit membrane 
(Fig. 9) (Grimley et al., 1970). 
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Fic 9. Evidence that crescents contain a single lipid bilayer. Electron micrograph of an 
infected cell 10 min following reversal of a rifampicin block (Grimley et al., 1970). Two 
crescents are connected by a continuous membrane, which has the dimensions and 
appearance of a single lipid bilayer. Reprinted from Grimley et al. (1970) with permission. 


In summary, while electron micrographs are subject to different 
interpretations, the bulk of the evidence, in particular from freeze 
fracture microscopy and rifampicin reversal, favors the model that 
crescents contain a single lipid bilayer formed beneath a scaffold 
composed predominantly of D13 protein. 


2. Are the Crescents Continuous with Cisternae, and is this Continuity 
Theoretically Necessary? 


Much has been made of the use of the term de novo to describe 
vaccinia viral membrane biogenesis. To some the appearance of mem- 
branes without continuity with other intracellular membranes is a 
heretical violation of biological principle, while to others it is simply 
another amusing example of the inventiveness of nature. Numerous 
electron micrographic studies have drawn conclusions on either side of 
this debate. On the one hand, for example, serial sections of crescents 
have been published which show no continuity between crescents and 
other membranes, and in the same study tilt series electron micro- 
graphs reveal that crescents which are in fact not continuous with 
cisternae, may appear to be in continuity when viewed at an appropri- 
ate angle, raising the possibility of artifact (Hollinshead et al., 1999). On 
the other hand, numerous, highly convincing electron micrographs 
exist which seem to show crescents in continuity with membranes or 
cisternae, or crescents with membranous extensions or loops at the ends 
(Ericsson et al., 1997; Griffiths et al., 2001a; Risco et al., 2002; Sodeik 
et al., 1993, 1994). In fact, the de novo argument may be irrelevant. 
Based on evidence cited in Section VI.B, a reasonable model for viral 
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membrane biogenesis is that lipid is delivered to factories from the ER 
as vesicles complexed with viral protein and that crescents may form 
and grow via addition of lipid and viral protein from these vesicles. 
Assuming the vesicles are relatively small and short lived, any number 
of structures may be visible in viral factories, including vesicles, tu- 
bules, crescents with no apparent continuity with other membranes, or 
crescents with long, short, or looped extensions at their ends. Further- 
more, all these structures may be subject to variation in sample prepa- 
ration and interpretation. Formation of a crescent, which contains a 
double bilayer, would be the obvious outcome of vesicle fusion. However, 
if crescents contain a single bilayer, then some mechanism must exist 
for building a single bilayer from a vesicle, tubule, or cisterna, and some 
“capping” mechanism must exist to sequester the hydrophobic core of 
the single membrane bilayer from the aqueous environment of the 
cytoplasm. Heuser (2005) suggests one possible mechanism for forma- 
tion of a single bilayer from virtually any other membrane, based on a 
current “T junction” model for lipid droplet formation (Fig. 10). Capping 
of the crescent ends might be accomplished by a novel, yet to be discov- 
ered viral gene product. Alternatively, the lateral edges of the crescent 
membrane may contain phospholipids which assume a curved, micellar 
formation rather than a true lipid bilayer, such that only polar head 
groups are in contact with the cytoplasm. 

In summary, it is possible that growing crescents may appear either 
continuous or discontinuous with other membranes; however, the pres- 
ence of absence of such continuities does not necessarily dictate wheth- 
er the crescent and IV membrane is a single bilayer or two tightly 
apposed bilayers. 


3. Are Nucleoids or Cores Bounded by Membranes? 


As described previously, two of the double membrane models require 
that the core be surrounded by at least one membrane; therefore, the 
question of whether or not the core is bounded by a membrane is 
relevant to distinguish between one and two membrane models for 
crescent and IV assembly. In common with other aspects of the debate, 
much of this discussion rests on electron microscopic data, which is 
subject to varied interpretation. For example, in cryo-electron micro- 
graphs, the inner layer of the core wall appears electron lucent or 
“white,” consistent with a lipid bilayer (Fig. 2A). While some authors 
interpret this layer as lipid (Cyrklaff et al., 2005; Sodeik et al., 1993), 
others conclude that its dimensions are inconsistent with a lipid bi- 
layer (Hollinshead et al., 1999). Similarly, while some images are 
interpreted to reveal a complex folding of membranes in developing 
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Fic 10. Model for poxvirus membrane growth. (Top) Model for lipid droplet formation 
from ER. (Bottom) Model for vaccinia crescent formation based on the lipid droplet 
formation model (Heuser, 2005). A crescent membrane, represented by a bilayer studded 
with lollipops, is extruded in a T junction from another bilayer, which could be part of any 
membrane structure. The lollipops represent a viral protein that directs the process. 
Reprinted from Heuser (2005) with permission. 


or mature particles consistent with the folded cisternae model (Model 4) 
(Griffiths et al., 2001a,b; Sodeik et al., 1993), numerous other images 
show no evidence of membrane boundaries within IV, IVN, or MV and no 
surface folds or continuity between internal and external boundaries in 
MV (Dales and Mosbach, 1968; Dales and Siminovitch, 1961; Essani 
et al., 1982; Heuser, 2005; Hollinshead et al., 1999; Mohandas and 
Dales, 1995; Morgan, 1976a,b; Risco et al., 2002). Perhaps the most 
compelling data relevant to the lipid content of cores consists of exami- 
nation of isolated cores. As described previously, cores are typically 
isolated by dissociating the MV membrane with a neutral detergent 
(NP40) and DTT. This treatment completely dissolves the MV mem- 
brane and should therefore extract any lipid from the cores, and yet 
the structure of the isolated core remains essentially the same as 
the core found in intact MV (Dubochet et al., 1994; Easterbrook, 
1966; Ichihashi et al., 1984; Wilton et al., 1995). Isolated cores lose the 
concavities present in mature MV and assume a more regular barrel 
shape but nevertheless maintain a bilaminar structure with an inner, 
smooth layer and an outer palisade layer, virtually identical to the core 
present in mature MV. If the core were to be encased in one or two lipid 
bilayers, then this lipid would have to be resistant to detergent extrac- 
tion sufficient to dissolve the membrane, or the lipid would have to 
contribute very little to the structure of the core, so that its absence 
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would not affect the gross structure of the core. Both of these latter 
possibilities seem unlikely. 

In summary, the weight of the evidence seems to favor the conclu- 
sion that the core is not bounded by a lipid bilayer. 


4. How Does the Virus Enter Cells? 


As noted previously, any of the two membrane models present pro- 
blems for entry of virus into cells. If Model 2 were correct, a simple 
fusion of the outermost viral membrane with the plasma membrane 
would deliver a membraned subparticle to the cell cytoplasm, which 
would then require further uncoating. If Model 3 or 4 were correct, then 
fusion of the outermost lipid bilayer of MV with the plasma membrane 
would leave the entire contents of the particle outside of the cell. While 
novel models for entry of a multiply membraned particle have been 
proposed, backed by some electron microscopy (Griffiths et al., 2001a; 
Locker et al., 2000; Pedersen et al., 2000), the best evidence to date 
shows that MV enters cells by a simple fusion event between the virus 
membrane and the plasma membrane, releasing the core and lateral 
bodies into the cell cytoplasm (Carter et al., 2005). Therefore, the mech- 
anism of entry of the virus particle favors the single membrane model 
(Model 1) for vaccinia crescent, IV, and MV formation. 

In summary, while no single experiment necessarily provides an 
incontrovertible distinction among the models proposed for the mem- 
brane structure of maturing and mature vaccinia virus, taken together 
the data strongly favor the original one membrane model. This leaves 
open several interesting issues to be resolved, foremost being the 
mechanism by which a single bilayer may be fabricated and stabilized 
using virus-modified cellular precursors in the viroplasm. For the 
balance of this review, we will use the single membrane model in 
discussing vaccinia structure and assembly. 


B. Source of the Virion Membrane 


Few topics in vaccinia morphogenesis have been as controversial as 
the source of the virion membrane (Sodeik and Krijnse-Locker, 2002). 
As described previously, de novo biogenesis was initially favored, first 
because no ultrastructural continuity with intracellular membranes 
could be observed and second, because the lipid content of the virion 
membrane was distinct from known cellular membranes (Dales and 
Mosbach, 1968). However, no plausible mechanism for the de novo 
assembly of a membrane within the cytosol has been proposed. The 
second model for membrane biogenesis was diversion of membranes 
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from a component of the secretory apparatus. Although none of the 
membrane proteins within MV were glycosylated, it was assumed that 
they were synthesized in the ER. This assumption followed first from 
the findings that several of the major membrane proteins (A14, A17) 
(Section VI.D) could be cotranslationally inserted into microsomal 
membranes in vitro, and second from ultrastructural observations. 
Indeed, the ultrastructural observations initially seemed to support a 
colocalization of the A17 and A14 proteins with the cellular Rab1 
protein, which was thought to be a specific marker of the ER/Golgi 
intermediate compartment (ERGIC) (Sodeik et al., 1993). Thus, deri- 
vation of the virion membrane from the ERGIC was assumed for a 
number of years. However, experiments involving pharmacological 
inhibitors (brefeldin A, nordihydroguiaretic acid) or dominant negative 
variants of cellular proteins (Sarlp) have argued against the role of 
either the COPI or COPII trafficking pathways in MV biogenesis 
(Husain and Moss, 2008; Ulaeto et al., 1995; Traktman, unpublished). 
These two vesicular trafficking pathways are the crux of the ERGIC 
system. Consequently, it seems more likely that a specialized compart- 
ment of the ER is the initial source of the virion membrane. Consistent 
with this idea, H89, a known inhibitor of ER exit site formation, and 
cerulenin, an inhibitor of de novo lipid biosynthesis, have been shown 
to block the recovery of virion morphogenesis in cultures released from 
an early morphogenesis block (Punjabi and Traktman, 2005). 

If the MV membrane were composed of two lipid bilayers, it would be 
relatively straightforward to envision how vesicular elements could 
fuse to form growing crescents and remain stable in the cytoplasm. 
However, diversion of a single lipid bilayer from the ER in order to 
form nascent crescents, stabilization of the exposed fatty acid chains in 
these crescents within the cytoplasm, and enlargement of these cres- 
cents to form the IV membrane, remains a conundrum. Lipid synthesis 
is normally centered in the smooth endoplasmic reticulum, and in 
recent studies delivery of lipids to existing organelles have been fo- 
cused on the subcellular compartments, plasma membrane associated 
membranes (PAM), and mitochondrial associated membranes (MAM) 
(Gaigg et al., 1995; Pichler et al., 2001). Gaining an understanding of 
how the virus accomplishes membrane biogenesis will be a fascinating 
challenge. 


C. Regulatory Proteins in Viral Membrane Formation 


The mechanism by which crescents form remains largely unknown. 
However, cumulative genetic data implicate four viral proteins in 
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regulating the earliest steps of this process. These are encoded by the 
F10L, H5R, G5R, and A11R genes. 


1. F10 


The F10 protein was first identified as the major protein kinase 
encapsidated in the virion core (VPK2) (Lin and Broyles, 1994). Re- 
combinant F10 has enzymatic activity in vitro, indicating that the 
protein possesses intrinsic kinase activity and does not need additional 
protein cofactors (Lin and Broyles, 1994; Traktman et al., 1995; Wang 
and Shuman, 1995). The predicted amino acid sequence of F10 retains 
some of the conserved sequence motifs associated with ATP binding or 
phosphotransfer, but other motifs cannot be recognized due to the 
divergence of the F10 sequence from that of the majority of protein 
kinases. F10 has been shown to direct phosphorylation of serine, thre- 
onine and tyrosine residues, indicating that it is a dual specificity 
kinase (DSP) (Derrien et al., 1999). The 50-kDa protein is expressed 
late during infection and undergoes autophosphorylation but does not 
appear to be proteolytically processed despite having the canonical 
AG|X motif associated with the virally encoded proteolytic processing 
machinery (Section IX.C) (Punjabi and Traktman, 2005). F10 is known 
to be part of a seven-protein complex implicated in the filling of grow- 
ing crescents with viroplasmic proteins (Section VII.A), but it has also 
been shown to be tightly associated with membranes during infection 
and to bind to some lipid species (phosphoinositide phosphates) in vitro 
(Punjabi and Traktman, 2005; Szajner et al., 2004a). F10 is itself 
encapsidated in nascent virions, with an average abundance of ~300 
copies/core. 

The importance of F10 during infection was foreshadowed by 
the inability of investigators to generate a viable virus in which the 
F10L gene had been deleted. Subsequent studies identified four 
temperature-sensitive mutants as encoding mutant alleles of F10L; 
cells infected with these viruses show a profound defect in virion 
morphogenesis at the nonpermissive temperature (Traktman et al., 
1995; Wang and Shuman, 1995). This phenotype has been recapitu- 
lated by engineering viruses in which F10 expression is dependent on 
inclusion of IPTG or tetracycline in the culture medium (Punjabi and 
Traktman, 2005; Szajner et al., 2004b). The latter studies also con- 
firmed that expression of a catalytically active F10 protein is essential 
for virus morphogenesis. The first arrest observed on inactivation of 
F10 occurs prior to the formation of any signs of membrane biogenesis. 
Although the late viral proteins accumulate normally and factories are 
formed, virosomes are either absent or malformed, and no crescents, 
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IV, IVN, or MV are observed (Fig. 11A). Under these circumstances, 
there is also no phosphorylation of the major membrane proteins A14 
or A17 (Section VI.D). When cells infected with tsF10 mutants under 
nonpermissive conditions were shifted to the permissive temperature 
at 12 hour postinfection (hpi), membrane biogenesis resumed, and 
crescents were observed within 45 min. Execution point studies have 
shown that the F10 kinase is also required for later steps in morpho- 
genesis; under certain experimental protocols, tsF10-infected cells 
show a defect in crescent filling that it is nearly indistinguishable from 
that observed when other proteins found within the seven-protein 
complex are repressed or inactivated (Section VII.A). Identification of 
viral and cellular proteins whose phosphorylation by F10 is important 
in morphogenesis is an area of intense interest. Although the phos- 
phorylation of a number of viral proteins either fails to occur during 
nonpermissive infections with tsF10 mutants (e.g., A14, A17, A30, 
G7) or is diminished (H5, F17), the phosphorylation of at least some 
of these proteins has been shown to be blocked when morphogenesis is 
arrested by other means. Thus, it is not clear which phosphorylation 
events are directly mediated by F10, and which may be mediated 
by other viral or cellular kinases in the context of productive morpho- 
genesis. Phosphorylation of at least H5 and A30 has been verified 
in vitro. 

In summary, studies with mutants in FIOL suggest that virus- 
mediated phosphorylation plays a critical role throughout MV assembly, 
starting from the very earliest events in virus membrane formation. It is 
important to reiterate that there is no evidence of membrane diversion or 
remodeling when F10 is repressed or inactivated. 


2. H5 


The H5 protein has a predicted MW of 22 kDa but has an electro- 
phoretic mobility consistent with a MW of 35 kDa, due to a proline-rich 
domain within the N-terminus (Gordon et al., 1991). The protein is 
expressed constitutively and is encapsidated at ~2500 copies/core 
(Traktman, unpublished). Some of the early literature on the H5 pro- 
tein is misleading because an antiserum purported to show specificity 
for H5 was in fact quite reactive against the comigrating H3 protein. 
The bona fide role of H5 during the viral life cycle has been confusing 
because genetic and biochemical studies have implicated H5 in several 
processes. The protein has an amphipathic helix and is phosphorylated 
in vivo; H5 can be phosphorylated by both vaccinia-encoded kinases 
(B1 and F10) as well as by cellular kinases (Beaud and Beaud, 2000; 
Beaud et al., 1995; Brown et al., 2000; Traktman, unpublished). 
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Fic 11. Representative morphogenesis arrests seen during nonpermissive infections 
with mutants affected in genes involved in MV formation. (A-C) Mutants affecting 
membrane formation. (D-F) Mutants affecting IV formation and encapsidation. (G-I) 
Mutants affecting core formation. (A) Formation of factories, but no virosomes and no 
evidence of morphogenesis: tsF10 mutant (Traktman et al., 1995); (B) accumulation of 
virosomes, membrane vesicles, and a few aberrant crescents: inducible A14 mutant 
(Traktman et al., 2000); (C) formation of virosomes and accumulation of membrane 
vesicles: inducible A17 mutant (Punjabi, Mercer, and Traktman, unpublished); (D) accu- 
mulation of empty IV and virosomes: tsG7 mutant (2nd arrest) (Mercer and Traktman, 
2005); (E) formation of IV and DNA crystalloids due to arrest prior to genome encap- 
sidation: inducible A13 mutant (Unger and Traktman, 2004); (F) accumulation of IV, 
DNA crystalloids and aberrant MV due to lack of genome encapsidation: tsI6 mutant 
(Grubisha and Traktman, 2003); (G) accumulation of aberrant IV due to absence of 
nucleoid and/or core formation: inducible F17 mutant (Mercer and Traktman, unpub- 
lished); (H) accumulation of IV and aberrant particles due to lack of core formation; tsI7 
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An interaction between H5 and A20, a stoichiometric component of the 
processive DNA polymerase, has been found by yeast two hybrid anal- 
ysis, thus implicating H5 in DNA replication (Ishii and Moss, 2002; 
McCraith et al., 2000). Consistent with this finding, one of the 
temperature-sensitive mutants from the Dales collection, which has 
been the subject of recent systematic analysis, has been shown to have 
a lesion within the H5 gene, and preliminary analysis of this mutant 
reveals a defect in DNA replication (D’Costa, Prins, and Condit, un- 
published; Lackner et al., 2003). Two lines of evidence also implicate 
H5 in the process of viral transcription: H5 stimulates late gene 
transcription in vitro, and H5 has been shown to interact with proteins 
which regulate late gene transcription (G2, G8, A2) (Black et al., 
1998; Dellis et al., 2004; Kovacs and Moss, 1996). However, a different 
temperature-sensitive mutant, generated by targeted alanine- 
scanning mutagenesis of the H5R gene, implicates H5 in virus assem- 
bly. This mutant, which has a dominant phenotype, is unimpaired in 
its ability to direct viral gene expression but shows a profound defect 
in virion morphogenesis (DeMasi and Traktman, 2000). The phenotype 
is quite similar to that seen in tsF10 mutants: virosomes are either 
absent or malformed, and no signs of membrane biogenesis are seen. 
Whether H5 plays a direct role in morphogenesis, or whether the 
mutant H5 protein somehow sequesters F10 and hence arrests mor- 
phogenesis, is not known. Because H5 is so abundant, it is also possible 
that it serves as a scaffold for transcription, replication, and virion 
assembly, thus accounting for the apparent diversity of roles for the H5 
protein during infection. 


3. G5 


The 50-kDa G5 protein is expressed at early times postinfection and is 
encapsidated in the virion core, although the abundance of the protein 
within virions has not been reported (da Fonseca et al., 2004). 
The essentiality of the G5R gene is suggested by its conservation 
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mutant (Kane and Shuman, 1993); (I) accumulation of aberrant MV due to defect in core 
wall formation: tsA3 mutant (insets are wrapped aberrant MV [lower left] and extracel- 
lular aberrant MV [mid right]) (Kato et al., 2004). (A) Reprinted from Traktman et al. 
(1995) with permission. (B) Reprinted from Traktman et al. (2000) with permission. (D) 
Reprinted from Mercer and Traktman (2005) with permission. (E) Reprinted from Unger 
and Traktman (2004) with permission. (F) Reprinted from Grubisha and Traktman 
(2003) with permission. (H) Reprinted from Kane and Shuman (1993) with permission. 
(I) Reprinted from Kato et al. (2004) with permission. 
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within most if not all poxviral genomes; attempts to isolate a virus 
deleted for the G5R ORF have been unsuccessful. Temperature-sensi- 
tive mutants carrying G5R lesions were isolated by targeted alanine 
scanning mutagenesis, and phenotypic analysis of two such mutants 
revealed a role for GSR in the earliest stages of virion morphogenesis 
(da Fonseca et al., 2004). As was seen in tsF10 and tsH5 infections, 
factories were formed during tsG5 infections; however, electron-dense 
virosomes were either absent or malformed, and, most importantly, no 
signs of membrane biogenesis were seen. 


4. All 


The 40-kDa A11 protein is expressed at late times postinfection and 
is phosphorylated in an F10-independent manner (Resch et al., 2005). 
An inducible recombinant in which A11 expression is dependent on 
inclusion of IPTG in the culture medium has been generated; repres- 
sion of AJ1R transcription has a profound impact on virus production 
and leads to an early arrest in virion morphogenesis. This is a striking 
observation, since A11 itself does not appear to be encapsidated and is 
therefore the first nonstructural protein to be identified as playing an 
essential role in the biogenesis of the virion membrane. In the absence 
of A11, factories were formed; however, no crescents, IV, IVN, or MV 
were seen. Electron-dense virosomes were observed, as were unusual 
areas of intermediate electron-density that were associated with mem- 
branous structures containing markers of the endoplasmic reticulum. 
These areas of intermediate density also contained the D13 crescent 
scaffold protein (Section VI.E). 


D. Membrane Proteins Essential for Membrane Biogenesis: 
A14 and A17 


The proteins encoded by the A14L and A17L genes are abundant 
components of the MV membrane, and both are essential for its 
biogenesis. 


1. A17 


The 203-aa A17 protein is expressed at late times of infection. The 
protein has been shown to undergo cotranslational insertion into mi- 
crosomal membranes in vitro and to associate with ER membranes 
in vivo (Krijnse-Locker et al., 1996). In the context of its association 
with microsomal membranes, A17 spans the membrane twice, with 
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the N- and C-termini extending into the solution and an internal 
hydrophilic loop extending into the lumenal space (Fig. 12) (Betakova 
et al., 1999a). An intramolecular disulfide bond is formed within this 
central loop (Betakova and Moss, 2000). A fraction of A17 proteins 
appear to form intermolecular disulfide bonds through a Cys residue 
found in the cytosolic C-terminal tail. 

The N- and C-termini of A17 are modified by proteolytic cleavage 
events, which occur at AG16|A and AG185|A motifs (Section IX.C) 
(Fig. 13). These cleavages occur earlier than those associated with 
maturation of the major core proteins and do not depend on the IV to 
MV transition, since they occur in rifampicin-arrested cultures in 
which morphogenesis is blocked prior to IV formation (Section VI.E). 
However, during the normal morphogenetic pathway, immunoKM 
analysis suggests that loss of the C-terminal tail, at least, occurs 
largely after IV are formed. The N-terminal hydrophilic tail of the 
mature form of A17 (aa 17-60) is accessible to antibodies and to 
protease in intact MV, confirming the surface localization of the pro- 
tein; the hydrophilic tail at the C-terminus is only predicted to be 23 aa 
long after proteolytic cleavage, which may limit its accessibility to 
these reagents (Betakova et al., 1999b; Wallengren et al., 2001). 

A17 also undergoes phosphorylation on Ser, Thr, and Tyr residues in 
an F10-dependent manner (Betakova et al., 1999b; Derrien et al., 1999). 
Phosphorylation of A17 fails to occur if morphogenesis is arrested prior 
to normal crescent formation. Tyr phosphorylation has been shown to 
occur on Tyr203, the terminal amino acid in the protein; conflicting 
results obtained with different pTyr antibodies has obscured the ques- 
tion of whether other Tyr residues also undergo modification. The 
Ser/Thr phosphorylation appears to occur solely in the C-terminal 
hydrophilic tail of the protein that is removed during morphogenesis. 

Inducible recombinants in which A17 expression is dependent on 
inclusion of IPTG in the culture medium have been generated; A17 is 
essential for virion production (Rodriguez et al., 1995, 1996; Wolffe 
et al., 1996). In the absence of A17, vast numbers of membrane vesicles 
accumulate at the periphery of electron dense virosomes (Fig. 11C). 
Thus, A17 does not appear to be essential for the F10-dependent 
process of membrane diversion but is essential for crescent formation. 
The A17 protein has been shown by coimmunoprecipitation analysis to 
interact with the A14 protein during productive membrane biogenesis 
(Betakova et al., 1999b; Mercer and Traktman, 2003; Rodriguez et al., 
1997); this interaction is mediated by the N-terminus of A17 (Unger, 
Mercer, and Traktman, submitted for publication). 
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2. Al4 


The 90-aa A14 protein, which interacts with the A17 protein, is 
another major component of the MV membrane. A14 undergoes cotran- 
slational insertion into microsomal membranes in vitro and has been 
shown to associate with ER membranes in vivo (Rodriguez et al., 1997; 
Salmons et al., 1997). The protein appears to span the membrane twice, 
assuming a topology in which the N- and C-terminal tails are lumenal 
and the central hydrophilic loop is exposed (Fig. 12). The C-terminal tail 
contains an N83HS motif, which is readily glycosylated in vitro but 
poorly glycosylated in vivo during wt infections (Mercer and Traktman, 
2003). Moreover, none of the glycosylated A14 protein appears to be 
encapsidated, suggesting that interactions of A14 with other viral pro- 
teins, or further posttranslational modifications (see later) may prevent 
glycosylation of A14. Alternatively, the majority of A14 may be synthe- 
sized in, or rapidly diverted to, a membrane domain that does not 
support glycosylation. This latter hypothesis is supported by observa- 
tions that the fraction of A14 that undergoes glycosylation is signifi- 
cantly higher when membrane biogenesis is prevented at an early stage 
by repression or inactivation of F10, H5, A17, or G7. 

A14 undergoes F10-dependent phosphorylation on Ser85, which is 
also part of the glycosylation motif discussed previously (Mercer and 
Traktman, 2003). Disruption of the glycosylation motif increases the 
level of A14 phosphorylation, suggesting that the two posttranslation- 
al modification systems are competing for access to the protein. Final- 
ly, the vast majority of A14 dimerizes by forming intermolecular 
disulfide bonds through Cys71. A mutant allele of A14 in which 
Cys71 has been altered to Ser is viable, although viral yields are 
reduced approximately fivefold. Virions containing this altered A14 


LN 


Fic 12. Schematic view of integral MV membrane proteins. The overall topology of 
known transmembrane proteins contained within the MV membrane is shown; external 
and internal refers to the orientation within MV. A14 and A17 are essential for biogene- 
sis; A13, A9, and L1 are required for virion maturation; H3 stimulates virion maturation 
and, along with D8, binds to glycosaminoglycans on the surface of target cells; G3, A21, 
L5, J5, A28, H2, G9, and A16 comprise an eight-member fusion/entry complex; 12 is also 
required for virion entry; A14.5 affects virulence in mice, and the function of I5 has not 
been determined. The number of amino acid residues is shown beneath each protein. 
Sites of phosphorylation, PO myristylation (chemical structure), proteolytic cleavage 
(dotted line), and disulfide bond formation (-S-) are shown. The symbol shown adjacent 
to the H2 and A16 proteins  indicaes that, although these proteins are known to contain 
intramolecular disulfide bonds, the number and position of these linkages are not known. 
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protein are exquisitely sensitive to nonionic detergents. Under condi- 
tions in which wt virions remain intact and infectious, NP40 treatment 
of these A14-Cys71 virions leads to the extraction of both A14 and A17 
and causes a 1000-fold decrease in specific infectivity. These findings 
support a model in which dimerized A14, interacting noncovalently 
with A17, forms a lattice within the MV membrane that sustains virion 
integrity, consistent with previous results from controlled degradation 
of MV described in Section II.C. 

Recombinant viruses in which expression of A14 is dependent on 
inclusion of either IPTG or tetracycline in the culture medium have 
been generated; expression of A14 is essential for virion production 
(Rodriguez et al., 1998; Traktman et al., 2000). In the absence of A14, 
vast numbers of vesicles (and/or tubules) accumulate within the cyto- 
plasm (Fig. 11B). Some aberrant crescents are seen which are adjacent 
to, but fail to make contact with, electron-dense virosomes that accu- 
mulate in factories. Importantly, the vesicles formed in the absence of 
A14 contain A17 and another membrane protein, D8, while the viro- 
somes formed contain the core proteins L4 and F17 (Traktman et al., 
2000). Thus, the A14 mutant phenotype is similar but not identical to 
that seen upon repression of A17. Most importantly, the presence of 
both of A14 and A17 is required to form bona fide crescents, which can 
progress to form the IV membrane. Whether the vesicles seen upon 
their repression represent stalled intermediates, or aberrant struc- 
tures formed by the collapse of unstable membranes, has not been 
determined. This is a question of considerable relevance to a fuller 
appreciation of the mechanism of IV membrane biogenesis. 

Transient complementation has been used to probe which features of 
the A14 protein are essential for membrane biogenesis. Sequences that 
flank the first transmembrane domain, and those within the external, 
hydrophilic loop are most sensitive to mutation. The latter region has 
been found to interact with the N-terminus of A17 in vitro; moreover, 
mutations that disrupt complementation activity in vivo disrupt 
interaction in vitro (Unger, Mercer, and Traktman, submitted for 
publication). 

In summary, the A17 and A14 membrane proteins are critical 
players in diversion of ER membranes to viral factories and formation 
of the viral membrane. The proteins are synthesized in the ER and 
trafficked to viral factories to form membrane-containing crescents. 
Within crescents and the mature virion membrane, A14 and A17 
appear to form a lattice that is stabilized by disulfide bonds, probably 
contributing to the sensitivity of the virion membrane to reducing 
agents. Finally, the A14—A17 lattice serves as an anchor within the 
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viral membrane to which several other proteins important in virion 
structure and morphogenesis attach (see in a later section). 


E. D18: The Crescent Scaffold Protein 


The “spicule” layer that comprises the convex surface of crescents, 
IV and IVN was first described in detail by Dales and Mosbach (1968). 
Based on studies with metabolic inhibitors that partially disrupted 
crescent formation these authors concluded that the spicule layer con- 
ferred curvature and rigidity to the crescents. This conclusion was 
confirmed and extended through studies with the antibiotic inhibitor 
rifampicin. Rifampicin, better known as an inhibitor of prokaryotic 
transcription, inhibits vaccinia virus morphogenesis (Grimley et al., 
1970; Moss et al., 1969a,b; Nagayama et al., 1970). In the presence of 
this drug, irregularly shaped membranes that lack a spicule layer 
accumulate around areas of electron-dense viroplasm. These mem- 
branes can be labeled with anti-A17 antibody using immune electron 
microscopy, confirming a relatedness to virus crescent membranes 
(Wolffe et al., 1996). Inhibition with rifampicin is rapidly reversible; 
normal crescents form within minutes after the removal of drug. 

Several lines of evidence demonstrate that the spicule protein is the 
product of gene D13L. First, immune electron microscopy shows that 
in a normal infection, antibody specific for the D13 protein decorates 
crescents and IV (Miner and Hruby, 1989; Mohandas and Dales, 1995; 
Sodeik et al., 1994; Szajner et al., 2005). In the presence of rifampicin, 
virus membranes do not label with D13 antibody, and instead D13 
antigen accumulates in inclusions within factories, removed from 
membranes. When the rifampicin block is reversed, newly formed 
crescents are now labeled with D13 antibody and the D13 inclusions 
disappear. Second, mutations that confer resistance to rifampicin map 
to the D13L gene (Baldick and Moss, 1987; Tartaglia and Paoletti, 
1985). Third, conditional lethal mutations in the D13L gene display a 
phenotype identical to the effects of rifampicin (Lackner et al., 2003; 
Stern et al., 1977; Zhang and Moss, 1992). 

Experiments demonstrate that the “spicule” layer is in fact a spheri- 
cal honeycomb lattice composed of D13 trimers (Heuser, 2005; Szajner 
et al., 2005). The D13 protein itself can associate with two integral 
virus membrane proteins, A17 and H3, thus providing a mechanism 
for scaffolding of the virus membrane onto the D13 lattice. As dis- 
cussed in Section IX.E.1, D13 is lost from virions during maturation 
from IVN to MV (Essani et al., 1982; Mohandas and Dales, 1995; 
Sodeik et al., 1994); how the completion of the IV membrane sends a 
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signal that activates dissolution of the D13 lattice, and how this lattice 
is dismantled, are intriguing questions that remain to be answered. 
In summary, the “spicule” layer that appears on the convex surface of 
crescents and the outside of IV and IVN is a temporary scaffold for 
assembly of the virus membrane and is composed primarily of a honey- 
comb lattice built from D13 protein trimers. It follows that the curva- 
ture of the IV membrane, and therefore the final size of IV, is largely 
determined by the structure of the D13 lattice. Parallels between the 
D183 lattice and the clathrin coats of endocytic vesicles are evident. 


VII. IMMATURE VIRION FORMATION: A COMPLEX PROCESS 


Mutants in seven different vaccinia genes produce a similar pheno- 
type which suggests a role for the gene products in the association of 
viral crescents with viroplasm. One of these seven proteins, A10 or 4a, 
is one of the most abundant virion core proteins. The other six pro- 
teins, A15, A30, D2, D3, G7, and J1, are found in a seven-protein 
complex along with the F10 protein kinase, which, as described previ- 
ously, phosphorylates several membrane and core proteins and plays 
a critical role throughout virus assembly. The role of each of these 
proteins in IV formation is summarized in the following sections. 


A. Seven-Protein Complex 


As membrane crescents appear, they are often seen at the periphery 
of electron-dense regions of viroplasm, which is known to contain viral 
proteins destined for encapsidation into the virion core. It is not known 
what protein-protein or protein—lipid interactions are involved in 
mediating the association of viroplasm with growing crescents: there 
are no known interactions of individual membrane proteins with core 
proteins. However, a group of seven core proteins, F10, A30, G7, J1, 
D2, D3, and A15, has been shown to form a multimeric complex, and in 
some cases the proteins are mutually dependent on one another for 
stability (Szajner et al., 2004a). Several members of the complex are 
phosphorylated in vivo in an F10-dependent manner. The stoichiome- 
try of the individual components within the complex has not been 
determined. In addition to being physically linked, these proteins show 
a genetic interaction, since comparable phenotypes are observed upon 
mutation or repression of individual members of the complex. As 
detailed in a later section, the phenotypes suggest a role for these 
proteins in the association of viroplasm with crescents. 
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1. F10 


F10 has been described previously in significant detail (Section 
VI.C.1). With regard to its involvement as a member of the seven- 
protein complex which mediates the association of viroplasm with 
growing crescents, it is worth recalling the fact that F10 has been 
shown to be tightly associated with membranes in vivo (Punjabi and 
Traktman, 2005). Direct evidence for the functional involvement of 
F10 in the association of viroplasm with crescents was obtained from 
ultrastructural analyses of tsF10 infections that were initiated at the 
permissive temperature in the presence of rifampicin, and then shifted 
to the nonpermissive in the absence of rifampicin at 12 hpi. The 
phenotype seen here was nearly identical to that observed upon dis- 
ruption of A30 or G7 expression (see later). Indeed, the phosphoryla- 
tion of A30 and G7 is dependent on F10 in vivo and also appears to be 
dependent on the assembly of the multimeric complex and the normal 
progression of morphogenesis (Mercer and Traktman, 2005; Szajner 
et al., 2004c). It seems likely that F10 mediates phosphorylation of 
these proteins in a context-dependent manner. 


2. A30 


Disruption of the A30L gene was shown to lead to a phenotype in which 
the association of nascent membranes with viroplasm was disrupted. 
This disruption led to the appearance of large virosomes and “empty” 
IV and/or pseudo-IV with multiple membrane wrappings. This pheno- 
type was observed when an inducible recombinant was generated 
permitting the repression of A30, or when a temperature-sensitive 
mutant, Dts46, was analyzed at the nonpermissive temperature (Szajner 
et al., 2001a,b). The tsA30 protein is extremely thermolabile: effectively, 
at the nonpermissive temperature, repression of the A30 protein is mim- 
icked. A30 is a 9-kDa protein that is found in viroplasm and encapsidated 
within virions; fractionation studies have demonstrated that A30 is 
loosely associated with the virion core. A30 is phosphorylated on Ser 
residues in vivo in an F10-dependent manner, and it has been shown to 
be a direct substrate of F10 in vitro. A30 has been shown to interact 
directly with the G7 protein (discussed in a later section), and these 
proteins appear to depend on one another for their mutual stability. 


3. G7 


G7 was first implicated in virion morphogenesis because of its asso- 
ciation with the A30 protein (Szajner et al., 2003). The 42-kDa G7 
protein is expressed at late times postinfection, encapsidated into 
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virion cores, and processed by proteolytic cleavage at two AG|X motifs 
during virion maturation (Fig. 13). The G7 protein is phosphorylated 
in vivo in an F10-dependent manner, but F10 does not appear to 
phosphorylate G7 in vitro, either when presented alone or in the 
presence of its binding partner A30 (Mercer and Traktman, 2005; 
Szajner et al., 2004c). The function of G7 in vivo has been addressed 
both by the generation of an inducible recombinant in which G7 ex- 
pression is IPTG-dependent and by characterization of Cts11, which 
encodes a defective G7 protein (Mercer and Traktman, 2005; Szajner 
et al., 2003). These two approaches have yielded somewhat different 
answers regarding G7’s function, as detailed below. 

During infections with Cts11, virosomes accumulate as do short 
membrane crescents that are embedded within a distinct matrix of 
medium electron density. Both the G7 and A80 proteins are stable 
during these infections, permitting the examination of the impact of 
a specific loss of G7 function. When various G7 and A30 proteins were 
synthesized in vitro and subjected to coimmunoprecipitation protocols, 
the tsG7 protein had lost the ability to interact with A30. When Cts11 





[p4a] 102 kDa [p4b] 73 kDa 
[4a] 62 kDa 9kDa 23kDa 6kDa [4b] 60 kDa 
core ? core ? core 
42 kDa 20 kDa 
G7 — Aiz — 
21 kDa 15 kDa 6kDa 15 kDa 
? 6kDa core ? core 
? 
28 kDa 23 kDa 
4kDa [vp8]25 kDa 2 kDa 2 kDa 
? core ? 21kDa ? 
membrane 


Fic 13. Proteolysis of vaccinia core and membrane proteins. The proteolysis of five 
core proteins (A10, A3, G7, A12, and L4) and one membrane protein (A17) is shown. 
Black lines represent the length of the precursor polypeptide, drawn to scale. Red arrow- 
heads represent known cleavage sites. Information concerning the precursor (MW, any 
pseudonyms in brackets, and localization if relevant) is given above each line. Informa- 
tion concerning each product (MW, pseudonyms in brackets, localization if known) is 
given below each line. ? = fate of the product is unknown. 


VACCINIA VIRUS: STRUCTURE AND ASSEMBLY 85 


infections were performed at permissive temperature in the presence 
of rifampicin for 12 h and then removed from the rifampicin block and 
shifted to the nonpermissive temperature, the phenotype observed 
mimicked what had been observed on repression of A30 (Fig. 11D). 
This phenotype, characterized by the accumulation of empty IV, is 
also observed when G7 is repressed. Interestingly, tsG7 mutants are 
noncomplementing with tsA30 mutants, providing additional genetic 
evidence for an interaction between G7 and A30 (Lackner et al., 2003). 
In summary, G7 appears to play crucial roles at two stages of morpho- 
genesis (Mercer and Traktman, 2005). First, G7 is required for the 
movement of crescents to the periphery of virosomes, where they 
enlarge; later, G7 is required for the filling of the crescents with 
viroplasmic material. The small amount of wt G7 that is likely to 
accumulate when infections with the inducible recombinant are per- 
formed in the absence of the IPTG inducer may be sufficient for the 
initial G7-dependent step, whereas the tsG7 protein must be unable to 
mediate this effect. 


4. J1 


The 21-kDa J1 protein is expressed at late times and encapsidated; like 
A30, although J1 is associated with virion cores, a fraction of the protein 
can be extracted with nonionic detergents. Purified J1 has been shown to 
form oligomers (Chiu et al., 2005), and interactions between the N- and 
C-termini of J1 have been observed in yeast two-hybrid analyses. The 
role of J1 in vivo has been analyzed using both an inducible recombinant 
in which J1 expression is IPTG-dependent and through the analysis of 
Cts45, which encodes a mutant J1 protein (Chiu and Chang, 2002; Chiu 
et al., 2005). Repression or inactivation of J1 phenocopies the repression 
of A30 or G7, and indeed both A30 and G7 become unstable in the absence 
of J1 at 40°C. J1 is part of the seven-protein complex involved in viro- 
plasm/membrane association, although the proteins with which J1 as- 
sociates directly have not been identified (Szajner et al., 2004a). The tsJ1 
protein remains stable at the nonpermissive temperature but does not 
associate with the multiprotein complex. 


5. A15 


The 10-kDa A15 protein was shown to be a component of the seven- 
protein complex by mass spectroscopy (Szajner et al., 2004a). This pro- 
tein is expressed at late times postinfection and encapsidated in the 
virion core. An inducible recombinant in which A15 expression is 
IPTG-dependent has been generated; in the absence of A15, the charac- 
teristic accumulation of empty IV was observed (Szajner et al., 2004a). 
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6. D2 


D2 was identified as being a component of the seven-protein complex 
by mass spectroscopy (Szajner et al., 2004a). The 16.9-kDa D2 protein 
had been known to be a component of the virion core (Dyster and Niles, 
1991). The temperature sensitivity of the Ets52 mutant is due to a 
mutation in D2 that renders the protein extremely thermolabile; at the 
nonpermissive temperature, few signs of morphogenesis are seen. 
Crescents and empty IV are observed only occasionally. The phenotype 
is similar to that seen for the leakier tsF10 mutants (Traktman et al., 
1995; Wang and Shuman, 1995). An inducible recombinant has been 
generated in which expression of D2 is dependent on IPTG. Even in 
the presence of IPTG, the virus forms small plaques, suggesting 
that the epitope tag appended to D2 is deleterious. Nevertheless, 
repression of D2 does severely impair virion maturation, leading to 
an accumulation of empty IV (Szajner et al., 2004a). 


7. D3 


The 27-kDa D3 protein is also a component of the virion core and 
seven-protein complex (Dyster and Niles, 1991; Szajner et al., 2004a). 
The phenotype of Cts35, which encodes a defective D3 protein that is 
extremely thermolabile, is virtually indistinguishable from that seen 
for tsD2. Consistent with their presence in the seven-protein complex, 
stability of the D2 and D3 proteins is mutually interdependent: both 
D3 and D2 are destabilized during tsD3 and tsD2 infections. 


B. A10: p4a/4a 


The AJOL gene encodes the virion precursor protein p4a (Wittek et al., 
1984). During morphogenesis from IV to MV, p4a is processed by two 
proteolytic cleavages into an N-terminal 62-kDa polypeptide, an inter- 
nal 9-kDa polypeptide, and a C-terminal 23-kDa polypeptide (Fig. 13) 
(Vanslyke et al., 1991b). Both the 62- and the 23-kDa polypeptides are 
packaged into virions; the fate of the 9-kDa polypeptide is unknown 
(Vanslyke and Hruby, 1994; Vanslyke et al., 1991b). The 62-kDa poly- 
peptide is also commonly called 4a. 4a localizes to the virion core (Sarov 
and Joklik, 1972); Ichihashi et al. (1984) suggest that in fact it is a 
component of the outer core wall. 4a comprises an impressive 14% of 
the mass of the mature virion, or 28% of the mass of the virion core 
(Sarov and Joklik, 1972). Thus, p4a and its proteolytic products are 
major determinants of vaccinia virion structure. 

A10 function has been examined using an inducible recombi- 
nant in which A10 expression is IPTG-dependent. In the absence of 


VACCINIA VIRUS: STRUCTURE AND ASSEMBLY 87 


inducer, normal crescents are seen, but the IV that form are aberrant 
(Heljasvaara et al., 2001). Many of the abnormal IV particles appear 
empty, but others contain dense patches of DNA-containing material. 
DNA aggregates also accumulate in the cytoplasm. The membranes 
surrounding the IV are normal in appearance and contain the A14 and 
A17 membrane proteins as well as the D13 scaffold protein, as judged 
by immunoelectron microscopy. Thus, A10 appears to be dispensable 
for the formation of the IV membrane but is required for the proper 
organization and packaging of the viroplasmic matrix that is normally 
included in IV. The phenotype observed on repression of A10 mimics to 
a great degree the phenotype of mutants affected in components of the 
seven-protein complex described previously. 

The A10 precursor and the 60-kDa cleavage product form stable 
complexes with another abundant virion core protein, A4 (Risco 
et al., 1999). The implications of this interaction are discussed in 
Section IX.B.3 along with the role of A4 in MV formation. 


VIII. IV—IVN TRANSITION: GENOME ENCAPSIDATION 


The process of viral DNA replication produces progeny genomes that 
will be encapsidated into nascent virions. Concatemeric intermediates 
are formed during the replication process, and these are processed to 
monomeric genomes by a virally encoded resolvase (Garcia et al., 2000) 
(Section VIII.B.3). Genome maturation occurs normally when virion 
morphogenesis is blocked in the very earliest stages of morphogenesis, 
for example, in H5, G5, and J1 mutant infections (Table II), indicating 
that this maturation is a prerequisite for, but is not coupled to, genome 
encapsidation (Chiu and Chang, 2002; da Fonseca et al., 2004; DeMasi 
and Traktman, 2000). Much remains to be discovered about the en- 
cpasidation of the viral genome, although it appears to be distinct from 
the process described previously for the filling of maturing crescents 
with viroplasmic proteins destined to form the virion core. One of the 
unresolved questions is whether genome entry precedes closure of 
the IV membrane. Serial section analysis shows that most fully closed 
IV do indeed have nucleoids (Morgan et al., 1955), suggesting that 
genome encapsidation is likely to occur prior to the closure of the IV 
(Morgan, 1976a,b). If this model is not correct, then the encapsidation 
machinery must either pass through a proteinaceous plug that has 
been proposed to mark the site of IV membrane closure, or it must pass 
through the IV membrane itself. Some investigators have proposed 
that genome entry causes membrane invagination, leading to the 
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presence of a membrane around the nucleoid and, consequently, the 
core. However, this model is not widely accepted (Section VI.A) (Fig. 8). 

It is not known exactly how replicated genomes are distributed 
throughout the cytoplasm, although sites of replication have been 
visualized by autoradiography or with antisera directed against BrdU, 
DNA, or the single-stranded binding protein, I3. When morphogenesis 
is arrested prior to the appearance of IVN, DNA crystalloids accumu- 
late in the cytoplasm (Ericsson et al., 1995; Grimley et al., 1970; 
Grubisha and Traktman, 2003; Unger and Traktman, 2004). These 
crystalloids are often surrounded by membranes, consistent with the 
proposal that DNA replication occurs within ER-delimited domains 
of the cytoplasm (Schramm and Locker, 2005; Tolonen et al., 2001) 
(Section V) (Fig. 5). 

The literature contains a number of ultrastructural images that have 
been interpreted as representing the ongoing process of DNA encapsi- 
dation (Morgan, 1976a,b). Although this process is commonly referred 
to as encapsidation of the viral “nucleoprotein,” there is in fact nothing 
known about how the genome is condensed and whether it is indeed 
bound to proteins. As described later, when either of the two proteins 
that have been specifically associated with genome encapsidation are 
repressed or inactivated, no other proteins appear to be missing 
from the virions (Cassetti et al., 1998; Grubisha and Traktman, 2003). 
These data do not support a model in which the genome is assembled 
into a conventional nucleoprotein complex. In fact, it may be that 
condensation of the genome is effected by interactions with polyamines; 
encapsidation of spermidine has been reported (Lanzer and Holowczak, 
1975; Saminathan et al., 2002). In any case, the genome has a predicted 
dimension of 2 nm x 68 pm, which must fit into the viral core. The 
approximate dimensions of the nucleoid, as observed in ultra thin sec- 
tions, are 0.1 x 0.15 um, indicating that the genome must undergo 
massive condensation before encapsidation. 

To date, four viral proteins have been implicated in maturation and 
processing of the viral genome. These include the core proteins A22 
(a Holliday resolvase), I6 and A32, and the membrane protein A13. 


A. Membrane Proteins 
1. A13 


The 70-amino acid A13 protein has a hydrophobic N-terminus, 
which is predicted to be the membrane-spanning domain; indeed 
A13 undergoes cotranslational insertion into microsomal membranes 
in vitro (Salmons et al., 1997) and is a component of the virion 
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membrane (Fig. 12). The hydrophilic tail of A13 undergoes phosphory- 
lation in vivo on Ser residues; mutation of Ser40—Ala prevents phos- 
phorylation (Unger and Traktman, 2004). This modification appears to 
be independent of the F10 kinase; Ser40 is found within the motif 
NSPPP, which is predicted to be a high-affinity site for the cellular 
kinase ERK. The function of A13 has been elucidated both by the 
analysis of a TET-inducible recombinant and a temperature-sensitive 
mutant (Cts40) (Unger and Traktman, 2004); the A13 protein in Cts40 
has a Thr48—Ile substitution and is thermostable but apparently 
inactive. When A13 is inactivated, a morphogenesis arrest is seen; IV 
are formed, but IVN are rare and MV are not seen (Fig. 11E). Numer- 
ous DNA crystalloids, which appear to be surrounded by membranes, 
accumulate in the cytoplasm. Thus, A13 appears to be essential for the 
encapsidation of DNA into IV and the subsequent maturation of 
IV—-IVN—MV. 


B. Core Proteins 
1. A32 


Bioinformatic analysis was first used to identify the A32 protein as a 
putative ATPase with limited homology to the encapsidation proteins 
encoded by bacteriophages and adenovirus (Koonin et al., 1993). The 
34-kDa protein is expressed at late times postinfection and is encapsi- 
dated; to date, there has been no experimental data confirming that 
the protein indeed possesses the predicted ATPase activity. An induc- 
ible recombinant in which expression of A32 is IPTG-dependent has 
been generated; in the absence of A32, the production of infectious 
virus is severely diminished (Cassetti et al., 1998). Morphogenesis 
appears normal through the IV stage, but then aberrant forms are 
seen. Spherical virions that appear dense or half-dense/half-empty are 
numerous, as are cytoplasmic DNA crystalloids. The aberrant virions 
have been purified and shown to contain the full-protein complement 
but to be devoid of viral DNA. 


2. I6 


The 44-kDa I6 protein was identified as binding with great specifici- 
ty and stability to the telomeric hairpins of the viral genome (Demasi 
et al., 2001b). The telomere—I6 interaction was shown to be dependent 
on the presence of the extrahelical bases, which are so characteristic of 
the viral telomeres. Very low levels of I6 are made at early times 
postinfection; the majority of the protein is synthesized after the onset 
of DNA replication. An average of ~35 copies of I6 is encapsidated in the 
virion core. A series of mutants with lesions in the J6L gene have been 
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generated by clustered charge-to-alanine mutagenesis, and one of these 
has a tight temperature-sensitive phenotype (Grubisha and Traktman, 
2003). At the nonpermissive temperature, the phenotype observed is 
almost indistinguishable from that seen on repression of A32: IV appear 
normal, cytoplasmic DNA crystalloids are abundant, and aberrant, 
spherical virions are produced that contain the full-protein complement 
but no viral DNA (Fig. 11F). A32 remains stable during these infections 
and is encapsidated into the aberrant virions, indicating that its 
encapsidation is independent of DNA encapsidation and I6 function. 
Conversely, no I6 is encapsidated into A32-deficient virions. 

These cumulative data support a model in which the presence of 
A32 within virions enables the encapsidation of a DNA genome on 
which I6 protein is bound to the telomeric hairpins. As described 
previously, repression of certain membrane proteins, such as A13, 
leads to the accumulation of IV and DNA crystalloids; it is possible 
that the presence of these proteins in the membrane is also important 
for A32 and/or I6-DNA encapsidation. 


3. A22, a Holliday Resolvase 


As described previously, concatemeric replication intermediates are 
processed to mature genomes by a virally encoded Holliday junction 
resolvase, the A22 protein (Garcia et al., 2000). The product of the A22R 
gene is a palmityl protein that is expressed late during virus infection 
and packaged into the virion core (Garcia and Moss, 2001; Grosenbach 
et al., 2000). Analysis of an inducible mutant demonstrates that in the 
absence of A22, concatemer resolution is defective, and virion morpho- 
genesis is arrested at an intermediate stage (Garcia and Moss, 2001). In 
these A22-deficient infections, MV formation is inhibited and IV accu- 
mulate, a very few of which have nucleoids. In addition, there is an 
accumulation of dense spherical particles similar to those seen in other 
encapsidation defective mutants. Thus, while genome maturation can 
proceed in the absence of virion morphogenesis, the converse is not true. 
If concatemer resolution is inhibited, virion morphogenesis is arrested 
and the nature of the intermediate particles that accumulate suggests 
that the defect lies in DNA packaging. 


IX. MV FORMATION 


Perhaps the most striking event during morphogenesis of vaccinia is 
the transition from IVN to MV. The precise protein composition of 
immature particles is difficult to assess because normal IV and IVN 
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are difficult to isolate in sufficient quantity or purity to permit rigorous 
biochemical study (Vanslyke et al., 1993). Nevertheless, as detailed 
later, a synthesis of the evidence to date suggests that IVN contain 
the full complement of proteins contained in MV, with the exception 
of a few membrane proteins that are added during the IV/IVN to MV 
transition. Furthermore, since both the IV membrane and the 
MV membrane contain the major viral membrane proteins A14 and 
A17, the membrane of IVN is the precursor for the MV membrane 
(Griffiths et al., 2001a; Rodriguez et al., 1997; Traktman et al., 2000). 
Thus the IV/IVN to MV transition involves a rearrangement of the 
contents of IVN to form a distinct core and lateral bodies, and in the 
process the particle is transformed from a spherical to a barrel shape. 
Indicative of the complexity of this process, a significant majority of 
virus mutants that affect virion morphogenesis affect the IV/IVN to 
MV transition. These mutants fall into two general classes: those that 
fail to assemble morphologically normal looking MV particles and those 
that assemble normal looking MV particles, which are nevertheless not 
infectious. Analysis of these mutants informs us that the IV/IVN to MV 
transition involves or requires: (1) assembly of the transcription appa- 
ratus within the core, (2) restructuring of the particle surface, including 
loss of the D13 protein, formation of numerous disulfide bond linkages 
by the virally encoded redox system and addition of several additional 
surface proteins, (3) proteolysis of several virion protein precursors and 
formation of an enzymatically active core, in part through the action of 
the viral phosphatase, and (4) movement of the finished MV from 
viroplasm to the periphery of the cell. Each of these events is discussed 
in more detail in later sections. 


A. Assembly of the Transcription Apparatus Within the Core 


The phenotype of virus mutants affecting two different virus coded 
transcription factors, VETF and rap94, reveals an intriguing relation- 
ship between assembly of the early transcription machinery within the 
virion core and formation of MV. Vaccinia RNA polymerase exists in 
two different forms, one that contains the subunit rap94 (the product 
of the H4L gene), and one that lacks rap94 (Broyles, 2003). The rap94- 
containing enzyme transcribes early viral genes exclusively, while 
RNA polymerase lacking rap94 transcribes exclusively intermediate 
and late genes. VETF, composed of the D6 and A7 proteins, binds 
specifically to early virus promoters and is required for early gene 
transcription in vitro (Broyles, 2003). Thus, specificity of the RNA 
polymerase for early promoters is thought to result from a specific 
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interaction between rap94 and VETF, although such an association 
has never been proven directly. 


1. D6 and A7: VETF 


Genetic analysis of VETF is facilitated by the existence of inducible 
recombinants and temperature-sensitive mutants. There exist multi- 
ple temperature-sensitive mutants with lesions in the D6R gene, and 
inducible recombinants have been generated which allow IPTG- 
dependent expression of both subunits of VETF, D6 and A7 (Christen 
et al., 1992; Hu et al., 1996, 1998). Ultrastructural analysis of cells 
infected with the temperature-sensitive mutants has not been re- 
ported, but such studies have been published for the inducible recom- 
binants. When either VETF subunit is repressed, a “normal” gene 
expression phenotype is observed; that is, all three classes of viral 
proteins, early, intermediate, and late, are synthesized in normal 
amounts and with normal kinetics.”, However, electron microscopic 
analysis of the mutant infections reveals a block in morphogenesis at 
the IV/IVN stage. MV are either absent (D6) or significantly reduced in 
number (A7) in the mutant infections. Instead, an abundance of mor- 
phologically normal IV and IVN accumulate and occasional abnormal, 
dense, spherical particles are seen in viral factories. Confocal micros- 
copy of A7-deficient infections stained for DNA and a virion membrane 
protein suggests that at least a fraction of the immature particles 


5 A general and seemingly peculiar feature of mutants in vaccinia genes affecting 
virion transcription enzymes is that they confer a phenotype that is expressed late rather 
than early during infection. Offhand, one might expect a mutation in a virion transcrip- 
tion enzyme, in particular a temperature-sensitive mutant, to affect early viral tran- 
scription and thus disrupt the infection at an early stage. However, all mutants in 
transcription enzymes reported to date proceed through the early phase of infection 
normally and display a phenotype that impacts the late phase of infection. In the case 
of inducible mutants, this phenomenon is not difficult to understand. The inducible 
mutant must be grown under permissive conditions in order to study the phenotype 
under nonpermissive conditions. During growth under permissive conditions, the wild 
type gene product is synthesized and packaged into virions, which are therefore compe- 
tent for early transcription under nonpermissive conditions. The nonpermissive infection 
therefore only tests for any requirement for the protein during the later phases of 
infection. In the case of temperature-sensitive mutations the same phenomenon is 
observed; however, in this case the virions synthesized under permissive conditions 
package mutant enzyme. One must therefore argue that the mutant proteins are not 
thermolabile in virions, but rather that they are temperature sensitive for synthesis, that 
is, newly synthesized mutant protein is either rapidly degraded or cannot form an active 
conformation at the nonpermissive temperature, and thus the phenotype observed 
represents the requirement for newly synthesized enzyme late during infection 
(Hooda-Dhingra et al., 1989). 
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contain DNA. Both mutants also accumulate abnormally large 
masses of viroplasm within viral factories. The protein composition of 
the accumulated IV and IVN is unknown. Thus in the absence of 
VETF, virus assembly arrests at the IV/INV stage, implying a pivotal 
role for VETF in formation of the viral core and transition from IVN 
to MV. 


2. H4: rap94 


Genetic analysis of the H4L gene, which encodes rap94, is also en- 
abled by the existence of both inducible and temperature-sensitive 
mutants affected in this gene (Kane and Shuman, 1992; Zhang et al., 
1994). As was described for the VETF mutants, inactivation or repres- 
sion of rap94 does not impair the profile of viral gene expression. Unlike 
the VETF mutants however, both the temperature sensitive and the 
inducible mutants in rap94 assemble particles that are morphologically 
indistinguishable from wt MV as judged by electron microscopy. These 
MV made under nonpermissive conditions are deficient in their ability 
to direct in situ transcription in vitro. Moreover, they lack not only 
rap94 as expected but also specifically lack all of the major known or 
suspected enzymes involved in early viral transcription, including the 
viral RNA polymerase, the capping enzyme/termination factor, the 
poly(A) polymerase large subunit, DNA-dependent ATPase I, RNA heli- 
case, and topoisomerase. These abnormal MV contain normal amounts 
of VETF. These results are interpreted to mean that a presumed rap94— 
VETF interaction seeds assembly of a transcription complex onto the 
viral genome and thus coordinates packaging of the transcription ma- 
chinery during virus assembly. Notably, the result also implies that the 
morphological transformation from IV to MV, including formation of the 
core and lateral bodies, occurs independently of the assembly of the 
transcription apparatus within the core. 

Together, studies with mutants in VETF and rap94 suggest that the 
early viral transcription factor VETF, perhaps in conjunction with or 
through interaction with other DNA binding or structural proteins, 
plays a critical role in the morphological transition from IVN to MV, 
and also, perhaps through an interaction with rap94, plays a critical 
role in recruitment and assembly of a transcription complex in the 
developing virion. The recruitment and assembly of the transcription 
complex itself, however, is not a prerequisite for virion maturation, in 
that normal appearing MV can be assembled in the absence of the 
transcription complex. Finally, it is worth recalling that encapsidation 
of VETF and the early transcriptional machinery does not appear to 
require their association with the viral genome, since DNA-deficient 
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particles assembled in the absence of the A32 protein do contain the 
early transcriptional machinery (Cassetti et al., 1998). 


B. Core Proteins Involved in the IV to MV Transition 


A number of core proteins are required for virion maturation, as 
revealed by the phenotypes exhibited by temperature-sensitive mu- 
tants that encode mutant forms of these proteins, or inducible recom- 
binants which allow the expression of the proteins of interest to be 
experimentally repressed. In some cases, repression or inactivation of 
the protein simply arrests morphogenesis at the IV or IVN stage; in 
other cases, aberrant particles are seen. Since not all publications are 
equally thorough in their ultrastructural analysis, implication of a 
given protein in a precise step of virion assembly is not always possi- 
ble. It would appear that when the D13 scaffold remains intact, 
there is an accumulation of IV and/or IVN. In contrast, when the D13 
scaffold is removed, then further virion maturation is attempted 
unsuccessfully, leading to the production of aberrant particles. 


1. [1 


The 36-kDa I1 protein is expressed at late times postinfection and 
encapsidated into virion cores (~700 copies/virion) (Klemperer et al., 
1997). The protein has nonspecific DNA binding activity but shows some 
preference for binding to the viral telomeres (Demasi et al., 2001a; Klem- 
perer et al., 1997). An inducible recombinant in which I1 expression is 
IPTG-dependent has been generated; upon the repression of I1, morpho- 
genesis arrests with the accumulation of numerous IV. IVN were also 
seen, although some DNA crystalloids were observed in the cytoplasm 
(Klemperer et al., 1997). The I1 protein is highly conserved among 
poxviral genomes, in keeping with its essentiality for virion maturation. 


2. F17 


The 11-kDa F17 protein is expressed at late times postinfection and 
is one of the major components of the core; ~27,000 copies of the 
protein are encapsidated per virion. This protein was initially referred 
to as p11 or VP11 (Kao and Bauer, 1987; Kao et al., 1981; Nowakowski 
et al., 1978a,b). The protein is phosphorylated in vivo on Ser residues; 
phosphorylation is only minimally dependent on the F10 kinase but is 
modulated by expression of the viral H1 phosphatase. It is likely that 
the protein is phosphorylated on two PSSP motifs which are target 
sites for cellular proline-directed kinases such as ERK1 and CDK1 
(Reddy, unpublished; Traktman, unpublished). The F17 protein is 
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quite basic and was originally described as a DNA-binding protein that 
could coat DNA stoichiometrically. However, normal amounts of F17 
are encapsidated even when the genomic DNA is not, suggesting that 
F17 is certainly not a classical “nucleoprotein” in the sense that was 
originally proposed (Cassetti et al., 1998; Grubisha and Traktman, 
2003). An inducible recombinant has been generated in which ex- 
pression of the F17R gene is IPTG-dependent; in the absence of F17, 
morphogenesis arrests with the appearance of aberrant IV (Fig. 11G) 
(Zhang and Moss, 1991a,b). In some of these particles, there is an 
internal nucleoid that resembles the DNA crystalloids described 
earlier. In other cases, the IV appear almost empty or contain small 
patches of condensed material and/or membrane. Because DNA crys- 
talloids do not accumulate in the cytoplasm, however (Traktman, un- 
published), it is likely that the absence of F17 does not impair genome 
encapsidation per se but impairs nucleoid formation and disrupts the 
internal organization of the IV. These aberrant IV do not form a light 
scattering band upon sucrose gradient ultracentrifugation, indicating 
that they may be unstable. 


3. A4 


The A4L gene encodes a 39-kDa protein that is expressed late and 
encapsidated into the virion. This protein was first identified as an 
immunodominant antigen and was referred to as p39 (Cudmore et al., 
1996; Demkowicz et al., 1992; Maa and Esteban, 1987; Roos et al., 
1996). A4 appears to be phosphorylated in vivo and is a substrate for 
the F10 kinase in vitro; A4 phosphorylation appears to be modulated 
by the expression of the viral H1 phosphatase (Traktman, unpub- 
lished). Immunoelectron microscopy has revealed that antibodies to 
A4 decorate the interior of IV but not the nucleoid, and within MV, 
appear to localize to a region between the core and membrane. On 
extraction of purified MV with neutral detergent and DTT, A4 parti- 
tions into both the core (insoluble) and the membrane (soluble) frac- 
tions. Consistent with these results, anti-A4 stains the surface of 
isolated cores, leading Roos et al. (1996) and Cudmore et al. (1996) to 
conclude that A4 comprises, at least in part, the “spike” or palisade 
layer of the outer core wall. The A4 protein forms a complex with both 
the A10 precursor protein, p4a, and the 62-kDa proteolysis product, 4a 
(Risco et al., 1999). Nevertheless, inducible A10 mutants present a 
significantly different phenotype than A4 mutants. A10 mutants are 
defective in the association of viroplasm with crescents that is required 
to form IV (Section VII.B). In contrast, A4 mutants are defective in 
formation of MV but form both normal IV and IVN; some abnormal 
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structures consisting of IV which appear to be surrounded by an addi- 
tional membrane are also seen (Williams et al., 1999). In this case the 
authors were able to purify virions from the A4-deficient infections. The 
mutant particles were seemingly normal in protein content, although 
they contained slightly more unprocessed A10 protein, consistent with 
an assembly block at the IV/IVN to MV transition. Thus, A4 appears to 
be necessary for core assembly during the IV to MV transition. 


4. A3 


The A3L gene encodes the p4b/4b protein, a major component of the 
virion core that undergoes proteolytic processing during the IV to MV 
transition. The 73-kDa precursor is cleaved to a mature protein of 60 
kDa, called 4b (Fig. 13), which accounts for 11% of the dry mass of the 
virion (Sarov and Joklik, 1972). Several independent studies localize 
4b to the core wall (Ichihashi et al., 1984; Sarov and Joklik, 1972; 
Wilton et al., 1995). Analysis of temperature-sensitive mutants that 
encode defective A3 proteins (Cts8, Cts26) has revealed that A3 is 
essential for the formation of a structurally normal core (Kato et al., 
2004). Under nonpermissive conditions, tsA3 mutants form normal IV 
and IVN but do not form normal MV. Instead, spherical or irregularly 
shaped particles are formed which accumulate at a distance from viral 
factories and can be wrapped in Golgi-derived membranes and released 
by exocytosis, as if they were normal MV (Fig. 111). The aberrant tsA3 
particles can be purified for analysis; they contain the normal comple- 
ment of virion proteins and DNA; however, their internal structures are 
malformed relative to wild-type MV. In the tsA3 mutants, mutant 4b 
protein is made, processed, and packaged in normal amounts in aber- 
rant virions, suggesting that aberrant particles observed in tsA3 mu- 
tants result from the inability of the mutant 4b protein to function in 
formation of a normal core wall. Importantly, the aberrant A3 mutant 
particles are defective in virion transcription despite containing viral 
DNA and an active transcriptional apparatus, demonstrating that 
proper core formation is required for transcription in situ. 


C. Proteolysis of Virion Protein Precursors; Involvement of Genes 
I7L and G1L 


An important role for proteolytic processing in virion maturation is a 
common theme in numerous systems, and vaccinia is no exception 
(Hellen and Wimmer, 1992). Early studies revealed that the major 
structural protein 4a, the product of the AJOL gene, is derived from 
the p4a precursor by proteolytic processing during the late stages of 
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virion assembly (Katz and Moss, 1970a,b). Subsequent studies have 
shown that in addition to A10 (p4a/4a), the virion core proteins A3 
(p4b/4b), L4 (VP8), A12, and G7, and the membrane protein A17 
are also derived via proteolytic processing of precursor polypeptides 
(Betakova et al., 1999b; Mercer and Traktman, 2005; Rodriguez et al., 
1997; Rosel and Moss, 1985; Sarov and Joklik, 1972; Szajner 
et al., 2003; Takahashi et al., 1994; Vanslyke et al., 1991a,b; Weir and 
Moss, 1985; Whitehead and Hruby, 1994b; Yang et al., 1988). The 
processing of each of the proteolyzed virion proteins is understood in 
significant detail, as summarized in Fig. 13. 

Alignment of all of the known cleavage sites for proteolytically pro- 
cessed vaccinia proteins yields the consensus sequence AG|X, where 
cleavage occurs between the G and the X (Vanslyke et al., 1991b). 
A survey of the vaccinia genome reveals 82 potential AG|X cleavage 
sites, although it has been shown experimentally that not all of these 
sites are cleaved (Whitehead and Hruby, 1994b). Nevertheless, given 
the abundance of potential cleavage sites, it seems likely that addition- 
al targets for morphogenesis-associated proteolysis will be discovered. 

Vaccinia contains two genes, IZL and GIL, which encode proteins 
with significant homology to known proteases. The G1 protein, a com- 
ponent of the virion core, contains the motif HXXEH that is common to 
a subset of metalloproteases (Ansarah-Sobrinho and Moss, 2004a; 
Hedengren-Olcott et al., 2004; Whitehead and Hruby, 1994a). In tran- 
sient infection—transfection assays, the G1 gene product can cleave the 
L4 precursor protein in a manner that depends on an intact HXXEH 
motif. Gl-dependent cleavage of L4 does occur at an AG|X motif, 
although cleavage occurs at a cryptic AG|S site rather that the natural 
AG|A site (Whitehead and Hruby, 1994a). An inducible recombinant in 
which expression of G1 can be experimentally regulated has been 
constructed; repression of G1 blocks the production of infectious virus 
at the IV to MV transition (see later) (Ansarah-Sobrinho and Moss, 
2004a). This phenotype cannot be complemented by expression of a 
GIL allele containing mutations in the HXXEH motif, implying that 
the predicted metalloprotease activity of G1 is essential for virus 
growth. However, although normal MV do not form when G1 is re- 
pressed, cleavage of the A3, A10, L4, G7, and A17 precursors occurs 
normally, showing that the G1 protein is not required for their proteo- 
lytic maturation. Thus, although G1 is clearly required for virion 
maturation, its role as a protease remains unresolved. 

The I7 protein, which is also a component of the virion core, has 
homology to known cysteine proteinases encoded by African swine 
fever virus and yeast (Andres et al., 2001; Kane and Shuman, 1993; 
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Li and Hochstrasser, 1999). Nonpermissive infections performed with 
temperature-sensitive mutants or an inducible recombinant affected 
in the I7L gene are characterized by defective processing of the A3, 
A10, L4, and A17 precursors. Neither this proteolysis defect nor the 
accompanying morphogenesis arrest at the IV to MV transition can be 
complemented by mutants containing alterations in residues predicted 
to comprise the active site of the I7 protease (Ansarah-Sobrinho and 
Moss, 2004b; Byrd et al., 2002, 2003; Ericsson et al., 1995). Direct 
biochemical evidence that purified I7 protein can cleave appropriate 
substrates in vitro is still lacking, and it is therefore formally possible 
that the impact of repression or inactivation of I7 on proteolysis in vivo 
is indirect. Nevertheless, it seems highly probable that the I7 cysteine 
protease is directly responsible for processing most or all of the viral 
proteins known to undergo morphogenesis-associated proteolysis. 
Proteolysis of the virion core proteins, but not the A17 membrane 
protein, is coupled to and probably required for maturation of IV/IVN to 
MV. Rifampicin treatment prevents proteolysis of core proteins but not 
the A17 membrane protein, suggesting that membrane protein matu- 
ration is an early event in virus assembly while core protein proteolysis 
is a late event (Betakova et al., 1999b; Katz and Moss, 1970a; Rodriguez 
et al., 1993). Inspection of Table II reveals that where tested, all mu- 
tants that are blocked in the IV/IVN to MV transition are also blocked in 
core protein proteolysis, with two notable exceptions, namely G1 mu- 
tants and A3 mutants, discussed later. Conversely, all mutants that 
accumulate morphologically normal MV show normal proteolysis of 
core proteins. Thus, there exists a correlation between proteolysis and 
the IV/IVN to MV transition. The core protein precursors are almost 
certainly contained within completed IVN prior to the conversion to MV. 
Because IV and IVN appear to be unstable in the standard hypotonic 
buffers used for virus purifications, the precise protein composition of 
immature virions is difficult to assess (Hu et al., 1996; Klemperer et al., 
1997; Ravanello and Hruby, 1994b; Vanslyke et al., 1993). Nevertheless, 
the presence of the A3, A10, L4, and G7 proteins, or more likely their 
precursors, within IV/IVN has been demonstrated by immunoelectron 
microscopy (Szajner et al., 2003; Vanslyke and Hruby, 1994). Important- 
ly, mutation of the I7 protease not only prevents proteolysis of core 
proteins but also prevents formation of MV (Ansarah-Sobrinho and 
Moss, 2004b; Byrd and Hruby, 2005; Ericsson et al., 1995; Kane and 
Shuman, 1993) (Fig. 11H). Instead, IV and IVN accumulate, and in 
addition dense, spherical aberrant particles (not visible in Fig. 11H) 
are formed (Ansarah-Sobrinho and Moss, 2004b; Ericsson et al., 1995). 
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The aberrant particles contain viral DNA and are identical to MV in 
protein composition, with the exception that the normally processed 
core proteins remain in their precursor form. The aberrant I7 particles 
also become wrapped to form aberrant WV and EV, suggesting that the 
membrane has been properly restructured (Section IX.E). In sum, the 
data indicate that proteolysis of one or more core proteins is required for 
morphogenesis from IVN to MV, although the presence of the I7 protein 
itself may also be required for the internal restructuring of the virion. 
The phenotype of I7 mutants also suggests that core formation and 
restructuring of the particle surface can occur independently of each 
other. 

The phenotypes of A3 mutants (described previously in Section 
IX.B.4) and G1 mutants provide interesting exceptions to the corre- 
lation between proteolysis of core proteins and the IV/IVN to MV tran- 
sition. These mutants display a phenotype that is virtually identical to 
the phenotype of mutants affected in the I7 cysteine proteinase: accu- 
mulation of IV and IVN, inhibition of MV formation, and accumulation 
of dense, spherical, or abnormally shaped particles that are indistin- 
guishable from MV in protein and DNA content and are wrapped 
to form aberrant WV and released by exocytosis to form aberrant EV 
(Fig. 11I) (Ansarah-Sobrinho and Moss, 2004a; Hedengren-Olcott et al., 
2004; Kato et al., 2004). However, the core proteins contained in the 
aberrant virions that form when A3 or G1 are mutated have been 
proteolytically processed to the mature forms. In the case of the A3 
mutant, the mutant phenotype likely results directly from an inability 
of the mutant A3 protein to form a good core wall (Section IX.B.4). The 
similarity between the I7 and A3 mutant phenotypes therefore suggests 
that failure to process the core proteins has a significant impact on 
formation of the core wall. The similarity of the G1 metalloprotease 
mutant phenotype to the I7 cysteine protease mutant phenotype is 
provocative, in that it suggests that the G1 protein does indeed act as 
a protease during infection, targeting substrates whose processing is 
also critical for core formation and the IV/IVN to MV transition. 


D. Membrane Proteins Involved in the IV to MV Transition 


There are a number of additional membrane proteins that are 
dispensable for formation of the IV membrane but play key roles in 
the IV-MV transition. How and when these proteins are added to the 
membrane, and how they assist in genome encapsidation and/or core 
formation, is a question of significant interest. 
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1. A9 


The 12-kDa A9 protein is expressed at late times of infection and 
incorporated into the virion membrane (Yeh et al., 2000). It is oriented 
with its C-terminus exposed on the surface of the virion and it appears 
to be present in both IV and MV (Fig. 12). The predicted trans- 
membrane domain is located in the middle of the protein; although 
the N-terminus of the protein contains a predicted signal sequence, 
cotranslational insertion of A9 into membranes has not been docu- 
mented. An inducible recombinant in which A9 expression is IPTG- 
dependent has been generated, and A9 has been shown to be essential 
for virion production. In the absence of A9, IV are formed, as well as 
aberrant IV that appear to be “empty” or only partially filled with 
viroplasmic material (Yeh et al., 2000). 


2. L1 


The L1 protein is a myristylated component of the virion membrane; 
a C-terminal hydrophobic domain is responsible for anchoring the 
protein in the membrane, although the timing and mode of insertion 
are not known (Fig. 12) (Franke et al., 1990; Ravanello and Hruby, 
1994a). L1 was recognized early on as containing immunodominant 
epitopes, and antisera to the L1 protein have potent neutralizing 
activity (Wolffe et al., 1995). The ectodomain of the L1 protein contains 
three intramolecular disulfide bonds that are formed by the vaccinia- 
encoded cytoplasmic thiol oxyreductase (Section IX.E.2) (Senkevich 
et al., 2002a); the crystal structure of the ectodomain of L1 has been 
solved (Su eż al., 2005). An inducible recombinant in which expression 
of the LIR gene is IPTG-dependent has been generated (Ravanello and 
Hruby, 1994b). L1 is essential for virion production; in the absence of 
L1, crescents are numerous and IV do form, but maturation of IV to 
MV does not occur. This defect can be complemented in trans by 
expression of wt L1, but expression of a derivative lacking the target 
site for myristylation, Gly2, fails to complement. Likewise, comple- 
mentation is lost when the Cys residues implicated in disulfide bond 
formation are altered (Blouch et al., 2005). 


3. H3 


The H3L gene encodes an immunodominant protein which is ex- 
posed on the surface of MV (Chertov et al., 1991; Zinoviev et al., 
1994). Expressed at late times postinfection, the H3 protein has a 
C-terminal hydrophobic tail which mediates posttranslational inser- 
tion of H3 into membranes; immunoelectron microscopy suggests that 
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H3 is not present in membrane crescents but associates with virion 
membranes during IV maturation (Fig. 12) (Section IX.E) (da Fonseca 
et al., 2000a). Antisera generated against H3 have been shown to 
neutralize viral infectivity (Lin et al., 2000). A soluble form of H3 binds 
to heparin sulfate on the cell surface, and indeed this protein competes 
with virions when present during adsorption (Lin et al., 2000). Togeth- 
er, these data support the conclusion that H3 may mediate initial, 
low-affinity interactions between MV and the cell surface. 

The H3L gene is not essential in tissue culture, since deletion mu- 
tants in which H3L has been insertionally inactivated are viable (da 
Fonseca et al., 2000b; Lin et al., 2000). To aid in the analysis of the 
function of H3, an inducible recombinant in which H3 expression is 
IPTG-dependent has also been constructed (da Fonseca et al., 2000b). 
Deletion or repression of H3 causes a distinct phenotype: plaque size is 
diminished, and viral yields are reduced by approximately 10-fold. 
Electron microscopic analysis revealed that, in the absence of H3, 
production of MV is reduced, and virosomes, crescents, and IV accu- 
mulate to higher than normal levels. The absence of H3 does not 
prevent the incorporation of L1 (see in an earlier section) or D8 (an- 
other membrane protein associated with binding to chondroitin sul- 
fate) into virion membranes. In sum, H3 appears to enhance the 
efficiency of the IV-MV transition, as well as mediating interactions 
between virions and heparin sulfate moieties found on the surface of 
target cells. 


E. Restructuring the Particle Surface 
1. Loss of D18 


As described previously, IV and IVN are coated on their outer surface 
with a “spicule layer” composed of a honeycomb lattice of the D13 
protein (Section VI.E) (Dales and Mosbach, 1968; Szajner et al., 
2005). Electron microscopic observations led Dales and coworkers to 
conclude that during the transition from IVN to MV, the spicule layer 
was lost and replaced by a morphologically distinct structure, “surface 
tubule elements” (Dales and Pogo, 1981). Although low levels of 
the D13 protein have been found within purified MV by immunoblot 
analysis and mass spectroscopy (Chung et al., 2006; Miner and Hruby, 
1989), this association is probably fortuitous (Franke and Hruby, 1987); 
immunoelectron microscopy reveals that anti-D13 antibodies decorate 
IV but not MV within infected cells (Mohandas and Dales, 1995; 
Sodeik et al., 1994). It is now accepted that, during the transition from 
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IV to MV, the D13 spicules are indeed lost from the maturing virus 
particles. Understanding the signal that initiates this disassociation 
will be of significant interest. 


2. Disulfide Bond Formation 


Numerous vaccinia virion structural proteins contain intra or inter- 
molecular disulfide bonds. These include the core proteins A4, L4, 
and A10 (4a), and the membrane proteins A14, A17, A27, D8, L1, 
A28, A21, L5, and H2 (Ichihashi and Oie, 1996; Ichihashi et al., 1984; 
Locker and Griffiths, 1999; Oie and Ichihashi, 1981; Rodriguez et al., 
1987, 1993; Senkevich and Moss, 2005; Senkevich et al., 2000b, 2004a; 
Townsley et al., 2005a,b; Senkevich, T. G., personal communication; 
Wolffe et al., 1995). Disulfide bond formation in a subset of the mem- 
brane proteins, L1, A28, A21, L5, and H2, is controlled by a virally 
encoded redox system, and disruption of this redox system disrupts 
virus morphogenesis, as described in the following paragraphs. 

The vaccinia coded redox system was uncovered through investiga- 
tion of vaccinia genes which are conserved throughout poxviruses and 
which also contained sequence motifs suggestive of redox capability 
(Senkevich et al., 2000a,b; White et al., 2000). The complete pathway 
consists of the E10, A2.5, and G4 proteins, which interact with each 
other to catalyze a cascade of disulfide oxidation—reduction reactions 
shown in Fig. 14, sequentially oxidizing and reducing intramolecular 
disulfide bonds within each enzyme and ultimately resulting in forma- 
tion of intramolecular disulfide bonds in a number of target proteins 
(Senkevich et al., 2002a). All three redox enzymes are associated with 
the virion; E10 and G4 both seem to be concentrated in the virion 
membrane. The target proteins include L1, F9, A28, A21, L5, and H2 
(Senkevich et al., 2000b, 2004a; Townsley et al., 2005a,b). L1 is de- 
scribed in Section IX.D.2, and A28, A21, L5, and H2, all components of 
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Fic 14. Vaccinia virus disulfide bond formation pathway. The coupled thiol-disulfide 
exchange reactions from E10R to the substrate proteins L1R and F9L are depicted. 
Electron transfers to FAD and oxygen indicated in the dashed rectangle are hypothetical 
(Senkevich et al., 2002a). Reprinted from Senkevich et al. (2002a) with permission. 
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a membrane-anchored complex controlling virus entry, are discussed 
in Section IX.G.3. The F9 protein shows significant homology to L1, 
containing the same six conserved cysteines, although the role of F9 in 
infection is currently unknown (Senkevich et al., 2000b). 

Repression of any of the three enzymes of the redox pathway, E10, 
A2.5, or G4, interrupts virion morphogenesis at the IVN to MV transi- 
tion (Ravanello and Hruby, 1994b; Senkevich et al., 2000a, 2002b; 
White et al., 2000). Specifically, in these mutants, MV are significantly 
decreased in number or absent from infection while IV accumulate. 
This does not imply that the redox pathway is active specifically during 
the IVN to MV transition; on the contrary, the redox substrates L1 and 
F9 are oxidized when morphogenesis is blocked at earlier stages by 
repression of the A17 membrane protein or treatment with rifampicin 
(Senkevich et al., 2000b). However, the fact that redox pathway mu- 
tants and L1 mutants (Section IX.D.2) produce the same phenotype 
suggests that the phenotype of the redox pathway mutants is, at 
least in part, a manifestation of their inability to oxidize the L1 disul- 
fide bonds, thereby presumably rendering L1 nonfunctional (Blouch 
et al., 2005). 


3. Addition of New Surface Proteins 


In 1962, in a brief, prescient, and elegant report, Morgan and cow- 
orkers showed that when ferritin-labeled antibodies prepared from 
rabbits immunized with vaccinia virus were used in immunoelectron 
microscopic examination of vaccinia-infected cells, MV were stained 
but IV were not stained. The authors concluded “that maturation of 
vaccinia virus into the stable, infectious form is accompanied not only 
by an alteration in structure but also by the acquisition of a surface 
coating of antigen not present on the immature particles” (Morgan 
et al., 1962). Forty-four years later, we understand with some precision 
the nature of a few of these modifications. Specifically, immunoelectron 
microscopy of infected cells demonstrates that antibody against either 
of the surface proteins A27 (Section X) or H3 shows significantly more 
staining of MV than IV, suggesting that both proteins are added to 
particles during the IV to MV transition (da Fonseca et al., 2000a; 
Sodeik et al., 1995). As described previously (Section IX.D.3), H3 is a 
transmembrane protein that undergoes posttranslational insertion 
into virion membranes. In contrast, the peripheral protein A27 is 
anchored to the viral membrane via its interactions with the A17 
transmembrane protein. A27 is not required for MV formation but 
rather for wrapping of MV with Golgi membranes to form WV. Thus, 
surface proteins can be incorporated during the later stages of virion 
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maturation by distinct mechanisms. We speculate that these additions 
occur after the departure of the D13 lattice from the IV membrane. 
Other candidates for surface proteins added during the IV to MV 
transition include the L1 protein (Section IX.D.2). Immunoelectron 
microscopy of virus infected cells using anti-L1 antibody shows a high- 
er density of L1 staining in regions containing MV than regions con- 
taining IV (Wolffe et al., 1995). As described previously, L1 is required 
for the IV to MV transition, a process that is also stimulated by H3. It 
seems likely that additional surface modifications during the IV to MV 
transition will be revealed in the future. 

Dales and coworkers have suggested that the spicule layer surround- 
ing IV, now known to be composed of D13 protein, is replaced by the 
STEs appearing on MV, raising the question of the relationship between 
STE and the proteins added late during MV maturation (Essani et al., 
1982; Stern and Dales, 1976). Wilton et al. (1995) examined the protein 
composition of isolated STE, and while they did not unambiguously 
identify each of the STE proteins, the results suggest that STE contain 
a mixture of proteins that appear in the crescents and MV membrane 
both early and late during morphogenesis, such as A17 and H3. Thus 
while the removal of D13 from IV and addition of several immunodomi- 
nant MV membrane antigens to the developing MV may restructure the 
particle surface, it is likely that STE represent a complex of several of 
the full complement of MV membrane proteins, rather than exclusively 
proteins added after the removal of D183. 


F. Core Proteins that are not Essential for Virion Morphogenesis 
but are Required for Transcriptional Competence of the Virion Core 


Mutation of the genes encoding certain core proteins results, under 
nonpermissive conditions, in the formation of MV that are normal in 
appearance but not infectious. Two of these proteins, the I8 RNA heli- 
case and the H6 topoisomerase, are virion enzymes. The phenotypes of 
I8 and H6 mutants are similar: permeabilized virions purified from 
nonpermissive infections display defects in early transcription. These 
data imply that the RNA helicase and DNA topoisomerase are both 
required to deal with the topological constraints associated with the 
virion transcription reaction (da Fonseca and Moss, 2003; Gross and 
Shuman, 1996). There also exist at least four virion proteins that are 
not thought to function in transcription per se but rather seem to affect 
the assembly of a core that is competent for in situ transcription. These 
include L4, L3, H1, and E8, described in more detail in the following 
sections. 
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1. L4 


The L4R gene encodes a 29-kDa protein that is proteolytically 
processed to the mature 25-kDa form during the IV to MV transition 
(Fig. 13). L4 was originally referred to as VP8 and has been shown to 
have both DNA and RNA binding activity (Bayliss and Smith, 1997; 
Yang and Bauer, 1988; Yang et al., 1988). An inducible recombinant has 
been generated in which L4 expression is IPTG-dependent (Wilcock and 
Smith, 1994, 1996). When L4 is repressed, formation of both IV and MV 
is observed by electron microscopy. The IV, however, appear slightly 
abnormal: they possess a space between the viroplasm and the inner 
surface of the membrane, as if bonding of the viroplasm to the mem- 
brane is weaker than normal. L4-deficient MV contain an otherwise 
normal complement of virion proteins, but they are defective in tran- 
scription as judged by both in vivo and in vitro assays, and hence they 
are not infectious. Furthermore, when these virions are extracted with 
deoxycholate to solubilize virion enzymes, more proteins appear to be 
more easily released than when L4-containing virions are examined in 
parallel. Thus, the internal structure of MV assembled in the absence of 
L4 appears to be somewhat fragile. These findings suggest that L4 plays 
a structural role in the assembly of a transcriptionally-competent core; 
in this regard, it is worth noting that L4 was originally reported to be 
encapsidated at ~7000 copies/virion. 


2. L3 


The L3L gene encodes a highly conserved 41-kDa protein that is 
packaged into virion cores. Repression of L3 expression results in the 
assembly of MV that are ultrastructurally normal in appearance but 
are reduced in infectivity (Resch and Moss, 2005). L3-deficient virions 
contain the normal complement of active transcription enzymes, but 
they are nevertheless defective for transcription in situ. Consistent 
with these observations, the L3-deficient MV can bind and enter cells, 
however, only very small amounts of early viral mRNA are synthe- 
sized. Thus, similar to L4 (above) and E8 (below), L3 deficiency may 
produce a subtle defect in MV structure that ultimately impacts on the 
transcriptional competency of the mutant virions. 


3. H1 


The HIL gene encodes a dual specificity protein phosphatase. This 
20-kDa protein is encapsidated at ~200 copies/virion (Guan et al., 
1991; Liu et al., 1995). It is known to regulate the phosphoryla- 
tion status of a number of viral and cellular proteins in vivo and has 
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been shown to dephosphorylate A17, A14, and F17 in vitro. An in- 
ducible recombinant has been generated in which H1 expression is 
TET-dependent. In the absence of H1, F17, A14, and A17 are hyper- 
phosphorylated. Moreover, the virions purified from H1-deficient in- 
fections show a phenotype much like that engendered by repression 
of L4: the cores show a reduced stability and are transcriptionally 
incompetent. Hence, H1 is not required for morphogenesis per se but 
rather affects the assembly of a transcriptionally competent core and 
hence is required for virion infectivity. 


4. E8 


E8 initially drew attention during a search for proteins which might 
associate with the ER membranes surrounding virus factories early 
during infection (Doglio et al., 2002; Schramm and Locker, 2005; 
Tolonen et al., 2001). The E8 amino acid sequence predicts the presence 
of two possible transmembrane domains, and both immunoelectron mi- 
croscopy and immunofluroescence suggests an association with viral and 
cellular membranes during infection. These observations led Locker and 
coworkers to propose a role for E8 in mediating the assembly of ER- 
delimited cytoplasmic domains within which viral replication would 
occur (Schramm and Locker, 2005) (Fig. 5) (Section V). However, other 
observations seem inconsistent with this model. First, the E8 protein 
clearly partitions to the virion core on extraction with NP40 and DTT. 
Second, while Locker and coworkers reported that E8 was expressed 
early during infection, more recent experiments demonstrate that E8R 
is a late gene, making it unlikely that the protein plays a role early during 
infection (Kato, Condit, and Moussatche, unpublished; Nichols and 
Traktman, unpublished). Lastly, infections with a temperature-sensitive 
mutant containing a lesion in the E8R gene result in the assembly of 
morphologically normal MV, WV, and EV; the MV that form show a defect 
in transcription in situ and a significant reduction in infectivity (Kato, 
Condit, and Moussatche, unpublished). Thus, in many respects E8 be- 
haves like a structural protein that contributes to the assembly of a 
transcriptionally active core, similar to L4, L3, or H1. 


G. Membrane Proteins Affecting Virus Binding, Entry, and Fusion 


Several additional MV membrane proteins have been identified 
which, when mutated, result in formation of MV that are morphologi- 
cally normal in appearance. Ten of these proteins have roles in virus 
entry and/or fusion as detailed in the following section. 
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1. Entry/Fusion Complex 


A complex of eight interacting proteins has been described, which is 
anchored in the virion membrane and whose components appear to 
mediate fusion of the virion membrane with the plasma membrane 
during virus entry (Ojeda et al., 2006; Senkevich and Moss, 2005; 
Senkevich et al., 2004b, 2005; Townsley et al., 2005a,b). The complex 
contains the proteins A21, L5, A28, H2, G3, G9, A16, and J5. At least 
four of the proteins contain intramolecular disulfide bonds catalyzed 
by the virus coded redox system (Section IX.E.2), and three are myr- 
istylated. Inducible mutants have been constructed for five of the 
genes encoding fusion complex proteins, and in all cases the mutants 
produce MV that are morphologically normal and transcriptionally 
active and which can undergo wrapping and export from cells as EV. 
These mutant MV can bind to cells but are defective in penetration. 
Repression of individual components of the complex does not appear 
to block the insertion of the remaining proteins into the virion mem- 
brane, although it does disrupt their ability to mediate virion entry. 
In sum, the membrane fusion complex is dispensable for virion 
morphogenesis but essential for virion entry. 


2. [2 


The J2L gene encodes a 78-amino acid protein with a hydrophobic 
C-terminus predicted to serve as a transmembrane anchor. Indeed 
(Nichols, Stanitsa, Traktman, submitted for publication), I2L is ex- 
pressed late, behaves as an integral membrane protein, and is encap- 
sidated into the virion membrane. I2 appears to be an essential 
protein, and an inducible recombinant in which I2 expression is TET 
dependent has been generated. In the absence of I2, the viral life cycle 
progresses normally, but the MV that are produced show an ~400-fold 
reduction in specific infectivity. These virions are transcriptionally 
competent and can bind to target cells but are defective in fusion- 
mediated entry. Interactions between I2 and the entry complex 
described previously have not been investigated. 


3. D8 


The D8 protein is expressed late and incorporated into the virion 
membrane; the transmembrane domain is found at the N-terminus of 
the protein, and the C-terminus is exposed on the surface of the virions 
(Niles and Seto, 1988). D8 is one of three proteins, including A27 and 
H3, which bind cell surface glycosaminoglycans and are therefore 
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implicated in initial events that facilitate the binding of virus to cells 
(Chung et al., 1998; Hsiao et al., 1998, 1999; Lin et al., 2000). A soluble 
form of D8 has been shown to bind to chondroitin sulfate on the cell 
surface and to compete with virions for binding to the cell (Hsiao et al., 
1999). Viruses in which D8 has been insertionally inactivated are 
viable and direct normal morphogenesis; however, the virions that 
are produced show a 10-fold reduction in specific infectivity due to an 
impaired ability to bind to target cells. 


X. TRANSPORT, OCCLUSION, AND SECONDARY WRAPPING OF MV 


As noted previously (Section III), most MV localize in clusters within 
the cytoplasm that are distal from the virus factories in which IV and 
IVN are assembled. Treatment of infected cells with drugs that in- 
hibit microtubule formation prevents this characteristic trafficking, 
implying that MV dissemination is a microtubule-mediated process 
(Sanderson et al., 2000). The mechanism by which MV interacts 
with microtubules has been investigated but remains unresolved 
(Sanderson et al., 2000; Ward, 2005). 

While the majority of MV remain intracellular and represent a 
functional end stage to virus assembly, a fraction of MV may undergo 
further processing to be either exported as EV or embedded within 
A-type inclusions (ATIs). Two MV membrane proteins are implicated 
in these processes, A27 and A26. 


A. A27 


A27 is unusual among MV membrane proteins in that it does not 
contain a transmembrane domain. Instead, the A27 protein forms a 
disulfide-bonded trimer, which is anchored to the MV membrane via a 
strong interaction with the integral A17 membrane protein (Lai et al., 
1990; Rodriguez et al., 1987, 1993). A27 is an immunodominant anti- 
gen, and anti-A27 antibodies possess virus neutralizing activity 
(Rodriguez et al., 1985). As described previously, A27 is added to virus 
particles late during maturation; it appears on MV but not IV (Section 
IX.E.3). When A27 is repressed, morphologically normal MV form that 
are fully infectious, although they are unable to undergo secondary 
wrapping by Golgi-derived membranes to form WV and, ultimately, to 
be released by exocytosis as EV (Rodriguez and Smith, 1990b; Ward, 
2005). The A27 protein has also been implicated in attachment of virus 
to cells and in intracellular transport of MV (Chung et al., 1998; 
Sanderson et al., 2000). 
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B. A26 


ATIs are large, cytoplasmic, electron-dense regions within which 
numerous MV are embedded (McKelvey et al., 2002). Formation of 
ATIs requires two viral genes, the ATI gene and the A26L gene. The 
ATI gene encodes the protein that forms the inclusion; the ATT gene is 
not essential, and indeed the WR strain of vaccinia does not express an 
intact ATI protein and hence does not form inclusions. The A26L gene, 
on the other hand, encodes an MV membrane protein, initially named 
p4c, which controls whether or not MV become embedded in inclu- 
sions. A26 is nonessential: if the p4c gene is inactivated, infectious MV, 
WV, and EV are produced. However, any inclusions that are formed do 
not contain embedded virus (McKelvey et al., 2002). The A26 protein is 
present on the surface of MV but not EEV, and it is therefore possible 
that the presence or absence of p4c in the virion membrane may 
influence whether MV undergo secondary wrapping to become WV 
and EV, or whether they remain unwrapped in the cytoplasm (Ulaeto 
et al., 1996). It is not yet known how A26 associates with the MV 
membrane; although some computer algorithms have predicted the 
presence of a transmembrane region, others do not, and the protein 
does not contain any regions of significant hydrophobicity. However, 
the A26 protein does contain a C-terminal region that is 44% identical 
to the region of the A27 protein that interacts with the membrane- 
bound A17 protein (see earlier). This observation raises the interesting 
possibility that A26 and A27 may compete for binding to A17 and thus 
determine the balance between MV that remain unwrapped and those 
that continue on to form WV and EV. 


XI. VIRION Proteins or UNKNOWN FUNCTION 


Four additional virion proteins of unknown function have been dis- 
cussed in the literature: the E11 and A12 core proteins and the I5 and 
A14.5 membrane proteins (Fig. 12). Little is known about E11 except 
that temperature-sensitive mutants with lesions in this gene produce 
noninfectious virions at the nonpermissive temperature (Wang and 
Shuman, 1996). An inducible recombinant in which expression of 
A14.5 is IPTG-dependent has been generated; repression of this protein 
has no impact on the viral life cycle in culture but leads to an attenuated 
phenotype in mice (Betakova et al., 2000). I5 (VP13) and A12 were 
identified during proteomic analyses of the vaccinia virion (Chung 
et al., 2006; Takahashi et al., 1994). No function has been ascribed to 
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either protein as neither gene has been subjected to genetic analysis, 
although both genes are highly conserved among poxviral genomes and 
as such the proteins are likely to play significant roles(s) during some 
facet of the viral life cycle. Encapsidation of I5 into the virion membrane 
has been verified by analysis of a virus encoding an epitope-tagged 
I5 allele at the endogenous locus (Nichols, Stanitsa, and Traktman, 
unpublished). The A12 core protein undergoes proteolytic maturation 
during virion morphogenesis (Whitehead and Hruby, 1994b). 


XII. SUMMARY AND PERSPECTIVES 


Investigation into poxvirus structure and assembly is now at a tech- 
nical and conceptual crossroads. Identification of proteins in the virion 
and basic characterization of mutants affecting assembly is approach- 
ing saturation. A detailed picture of the contribution of individual genes 
to the structure and stepwise assembly of the virion is therefore begin- 
ning to emerge. The genetic, biochemical and cell biological tools are 
in place to bring this task to completion. Fascinating and significant 
questions remain to be answered, including: 


A. What are the precise localizations within the virion of each of the 
encapsidated proteins? What is the composition and structure of the 
tube-like structure within the core? What proteins make up the indi- 
vidual layers of the core wall? What proteins are contained in 
the lateral bodies and what are their functions? What are the struc- 
tural relationships among the proteins in the membrane and how does 
this contribute to the bilaminar appearance of this structure? 

B. What is the initial source of the membrane used to form crescents, 
how is the single bilayer of the crescents and IV assembled and stabi- 
lized in the virus factories, and what are the contributions of cellular 
components and individual viral proteins to this process? 

C. How are viral factories organized so that assembly components 
are delivered to maturing virions in an organized fashion? 

D. What are the details of the molecular interactions that bind cres- 
cents to viroplasm, and how does this relate to the ultimate maturation of 
the virion core? 

E. What is the precise mechanism of DNA encapsidation into 
virions? 

F. What triggers the transition from IVN to MV, and how are the 
various events in this transition co-coordinated, including loss of D138, 
restructuring of the virion surface, proteolysis of structural proteins, 
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maturation of the core, and migration of maturing virions from 
factories to the periphery of the cell? 


A thorough understanding of poxvirus morphogenesis will provide 
invaluable insights into this complex and biomedically significant virus. 
Moreover, these studies should lead to the elucidation and discovery of 
cell biological strategies that have deep relevance to such diverse pro- 
cesses as organelle biogenesis, intracellular protein trafficking, and 
membrane remodeling. 
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ABSTRACT 


Carcinoma of the uterine cervix, a leading cause of cancer death in 
women worldwide, is initiated by infection with high-risk types of human 
papillomaviruses (HPVs). This review summarizes laboratory studies 
over the past 20 years that have elucidated the major features of the 
HPV life cycle, identified the functions of the viral proteins, and clarified 
the consequences of HPV infection for their host cells. This information 
has allowed the development of various strategies to prevent or treat 
infections, including prophylactic vaccination with virus-like particles, 
therapeutic vaccination against viral proteins expressed in cancer cells, 
and antiviral approaches to inhibit virus replication, spread, or patho- 
genesis. These strategies have the potential to cause a dramatic reduc- 
tion in the incidence of cervical carcinoma and serve as the prototype for 
comprehensive efforts to combat virus-induced tumors. 


I. INTRODUCTION 


The study of tumor viruses has provided many insights into funda- 
mental cellular processes such as cell cycle control, signal transduction, 
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proliferation, apoptosis, and senescence. Increased understanding of 
human tumor viruses has also suggested new approaches to prevent 
or treat cancer. Approximately, 15% of all cancer deaths worldwide are 
caused by cancers initiated by human tumor viruses, with a particularly 
high burden in the developing world (Parkin et al., 2001). However, only 
a small fraction of people infected with tumor viruses develop cancer, 
and cancer typically arises many years after the initial infection. The 
recognition that a cancer has a viral etiology and the identification of 
the responsible virus suggest various strategies to inhibit cancer forma- 
tion, including vaccination to prevent infection. The persistence of viral 
genes and gene products in cancer cells raises the possibility that anti- 
viral approaches will be useful in preventing and treating cancer, even 
in its late stages. This review illustrates a number of these principles 
with regard to one of the best understood human tumor viruses, the 
human papillomaviruses (HPVs), and carcinoma of the uterine cervix. 


II. Lire CYCLE OF THE PAPILLOMAVIRUSES 


Papillomaviruses are small, nonenveloped DNA viruses that have 
been isolated from numerous vertebrate species, including humans. 
The viral genome is a double-stranded circular DNA molecule approx- 
imately 8000 base pairs in length that encodes no more than ten pro- 
teins. There are two clusters of viral genes, the early and the late 
genes, both of which are transcribed from the same DNA strand (Engel 
et al., 1983). The early region encodes genes primarily involved in viral 
DNA replication and cell transformation, and the late genes encode the 
structural proteins of the virus particle. A segment of DNA upstream 
of the early genes is devoid of protein coding sequences and contains 
the origin of viral DNA replication and transcriptional regulatory 
elements (Chong et al., 1991). Transcriptional control is complex, in- 
volving multiple promoters and enhancers, two major polyadenylation 
sites, and an array of alternatively spliced mRNAs (Bernard, 2002). 
Proper transcription of the viral genome depends on the differentiation 
state of the host epithelial cell and on cellular transcription factors 
and polymerases, as well as the splicing and other RNA processing 
machinery (Bernard, 2002; Longworth and Laimins, 2004). 

Studies of the HPV life cycle have been hindered by the inability of 
papillomaviruses to undergo high-level replication in the laboratory. 
Nevertheless, certain aspects of the virus life cycle can be studied in 
animals and in various cultured cell systems, including organotypic 
epithelial cell systems that mimic the tissue architecture of the normal 
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host cells and support limited viral replication (Asselineau and 
Prunieras, 1984; Brandsma and Xiao, 1993; Brandsma et al., 1991; 
Dollard et al., 1992; Frattini et al., 1996; Meyers et al., 1992, 1997). 
Productive papillomavirus replication in natural settings takes place 
exclusively in stratified squamous epithelia, such as the epidermis or 
mucous membranes, tissues that are replenished by periodic replica- 
tion of basal cells apposed to the basement membrane. HPV infection is 
thought to be initiated by inoculation of the virus into basal epithelial 
cells through wounds or abrasions, and cell binding and entry is 
mediated by a poorly characterized cellular receptor that is probably 
a heparin sulfate proteoglycan (Giroglou et al., 2001; Joyce et al., 1999; 
Shafti-Keramat et al., 2003). In uninfected tissue, as daughter cells 
migrate away from the basement membrane toward the surface of the 
epithelium, they cease proliferation and undergo a process of terminal 
differentiation until they eventually die and slough off from the sur- 
face of the tissue. Productive papillomavirus infection disrupts normal 
epithelial differentiation (McCance et al., 1988) and causes epithelial 
hyperplasia and the formation of benign epithelial tumors known as 
warts (or papillomas or condylomas, depending on the anatomic site of 
the lesion). 

Like other small DNA viruses, HPVs utilize the host-cell DNA repli- 
cation machinery to establish an environment able to support viral 
DNA replication and to carry out DNA synthesis. However, once 
epithelial cells exit the basal layer, they also normally exit the cell 
cycle. Therefore, the virus has to maintain the suprabasal cells in a 
state competent for DNA replication. This is primarily the task of 
the HPV E7 protein (Cheng et al., 1995; Foster and Galloway, 1996). 
The E7 protein binds to and destabilizes the cellular retinoblastoma 
(Rb) tumor suppressor protein and members of the Rb family, thereby 
displacing E2F transcription factors, which then stimulate expression 
of genes required for cellular (and viral) DNA synthesis and cell 
cycle progression (Boyer et al., 1996; Chellappan et al., 1992; Dyson 
et al., 1989; Gonzalez et al., 2001; Munger et al., 1989b). The E7 protein 
also inhibits the action of cyclin-dependent kinase inhibitors and inter- 
acts with cyclin—cdk complexes and other cellular proteins to stimulate 
cell cycle progression (Funk et al., 1997; Jones et al., 1997; McIntyre 
et al., 1996; Noya et al., 2001; Tommasino et al., 1993; Zerfass-Thome 
et al., 1996). In addition, the HPV E6 protein perturbs cell cycle control 
by binding to the p53 tumor suppressor and targeting it for accelerated 
ubiquitin-mediated degradation (Scheffner et al., 1990, 1993; Werness 
et al., 1990). In combination, these E6 and E7 activities generate 
host cells that can support viral DNA replication. Thus, both E6 


128 DANIEL DIMAIO AND JOHN B. LIAO 


and E7 are essential for the productive viral life cycle (Brandsma et al., 
1991; Flores et al., 2000; McLaughlin-Drubin et al., 2005; Wu 
et al., 1994). 

Viral DNA replication is primarily mediated by the papillomavirus E1 
and E2 proteins working in concert with cellular DNA polymerases and 
auxiliary replication proteins (Conger et al., 1999; Loo and Melendy, 
2004; Masterson et al., 1998; Park et al., 1994; Piirsoo et al., 1996; Ustav 
and Stenlund, 1991). The E1 and E2 proteins both bind specifically to the 
origin of viral DNA replication (ori) upstream of the E6 gene (Androphy 
et al., 1987; Holt et al., 1994; Spalholz et al., 1987; Ustav et al., 1993; 
Wilson and Ludes-Meyers, 1991). The E1 protein, an ATP-dependent 
helicase, is the only viral coded enzyme (Sedman and Stenlund, 1998; 
Seo et al., 1993; Yang et al., 1993). It plays essential roles in papillomavi- 
rus DNA replication by melting ori as well as unwinding the parental 
DNA strands in advance of the replication fork (Chen and Stenlund, 
2002; Gillette et al., 1994; Sanders and Stenlund, 1998; Sedman and 
Stenlund, 1998). The participation of the E1 protein in viral DNA repli- 
cation is assisted by the E2 protein, which forms a complex with the 
E1 protein and enables its high-affinity, high-specificity binding to ori 
(Frattini and Laimins, 1994; Mohr et al., 1990; Sedman and Stenlund, 
1995; Yang et al., 1991). Proteins and small molecules have been 
described that inhibit E1 helicase activity, impair complex formation 
between the E1 and E2 proteins, or block E1 or E2 DNA binding (Deng 
et al., 2004; Faucher et al., 2004; Hartley and Alexander, 2002; Kasukawa 
et al., 1998; White et al., 2003). In some cases, these agents interfere with 
HPV DNA replication in cultured cells and may form the basis of future 
antiviral approaches. 

The viral genome replicates at a low level in the nuclei of the infected 
cells in the basal layer, but no progeny virus is produced in these cells. 
In order for productive infection to take place, the viral DNA must 
replicate to a high-copy number in differentiating cells that have 
migrated away from the basal layer (Bedell et al., 1991; Longworth 
and Laimins, 2004). This high-level DNA replication may involve a 
switch to a rolling circle mode of viral DNA replication (Flores and 
Lambert, 1997), but the biochemical mechanism responsible for this 
switch is unknown. 

The E2 protein also plays an important role in segregation of viral 
DNA plasmids as cells divide (Ilves et al., 1999; Lehman and Botchan, 
1998). By simultaneously binding to viral genomes and cellular 
chromatin-associated proteins, such as Brd4, during mitosis, the E2 
protein can “piggy-back” on cellular chromosomes to ensure proper 
segregation of the viral DNA into daughter cells upon cell division 
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(Bastien and McBride, 2000; Baxter et al., 2005; Skiadopoulos and 
McBride, 1998; Van Tine et al., 2004a; You et al., 2004). As discussed in 
a later section, the E2 protein is also involved in transcriptional control 
(Hawley-Nelson et al., 1988; Spalholz et al., 1985). Genetic studies indi- 
cate that the E6 and E7 proteins also play a role in stable maintenance of 
HPV plasmid DNA (Park and Androphy, 2002; Thomas et al., 1999). 

HPV late mRNAs and capsid proteins are produced only in highly 
differentiated cells in the superficial layers of the epithelium. In these 
cells, mRNAs that encode abundant quantities of the L1 major capsid 
protein, as well as the L2 minor capsid protein, accumulate through a 
variety of mechanisms including activation of promoters, bypass of 
the early polyadenylation signal, and use of novel mRNA splicing pat- 
terns (Barksdale and Baker, 1993, 1995; Collier et al., 2002; Cumming 
et al., 2002; Kennedy et al., 1991; Klumpp and Laimins, 1999; Oberg 
et al., 2003; Ozbun and Meyers, 1998; Terhune et al., 1999; Zhao et al., 
2004). The L1 and L2 proteins are transported into the nucleus where 
they assemble into virus particles and package replicated viral DNA. 

Two other papillomavirus proteins play roles in the virus life cycle. 
The E5 protein is a very short, transmembrane protein encoded at the 
3’ end of the early region of many papillomavirus types. The E5 protein 
from bovine papillomavirus type 1 causes tumorigenic transformation 
of cultured fibroblasts by inducing ligand-independent activation of 
the platelet-derived growth factor @ receptor (DiMaio et al., 2000; 
Petti et al., 1991), but the role of the E5 protein in the normal life cycle 
of bovine papillomavirus is not known. The E5 proteins of the HPVs are 
also required for optimal productive infection (Fehrmann et al., 2003; 
Genther et al., 2003). This requirement may involve interaction of the 
E5 protein with the epidermal growth factor receptor or the vacuolar 
H -ATPase (Conrad et al., 1993; Genther Williams et al., 2005; Hwang 
et al., 1995; Schapiro et al., 2000; Straight et al., 1993, 1995; Zhang et al., 
2005). The E4 protein is encoded by a gene in the early region that 
entirely overlaps the E2 gene in an alternative translational reading 
frame. Nevertheless, the E4 protein is abundantly expressed during the 
late stages of the HPV life cycle and may facilitate release of mature 
virus from cells (Brown et al., 2005; Doorbar et al., 1991; Nakahara 
et al., 2005; Peh et al., 2004; Wilson et al., 2005). 


III. EVIDENCE FOR AN ESSENTIAL ROLE oF HPV IN CERVICAL CARCINOMA 


Papillomaviruses have long been known to cause benign epithelial 
warts, but the role of HPVs in malignant disease has been elucidated 
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only within the last 25 years. The association with HPV infection is 
strongest for cervical carcinoma, a leading cause of cancer death in 
women in the developing world. HPV infection is also strongly asso- 
ciated with other anogenital cancers, approximately 20% of oropha- 
ryngeal cancers, and some nonmelanoma skin cancers (Gillison, 2004; 
Parkin et al., 2001). The first hint that cervical cancer had an infec- 
tious component was the observation in 1842 that uterine (presumably 
cervical) cancer developed in married women in Florence, Italy, but 
was rare in celibate nuns who lived in convents outside of the city 
(Scotto and Bailar III, 1969). In retrospect, this pattern reflected the 
existence of a sexually transmitted agent. Over the next 140 years, the 
notion that an infectious agent initiated cervical carcinogenesis gained 
credence, but the identity of the agent and the carcinogenic mecha- 
nism remained unknown. For some time, herpes simplex virus type 
2 was proposed as a causative agent for cervical carcinoma, but this 
hypothesis has now been discarded (Vonka et al., 1984, 1987). The 
molecular cloning and sequencing of HPV genomes from various types 
of papillomas in the early 1980s revealed a remarkable multiplicity of 
related but distinct HPV genotypes. Seminal studies by Harald zur 
Hausen and others identified several HPV DNA types in genital warts, 
cervical cancer biopsies, and cervical cancer cell lines (Boshart et al., 
1984; de Villiers et al., 1981; Durst et al., 1983; Gissmann et al., 1983; 
McCance et al., 1985; zur Hausen et al., 1981). The HPV DNA types 
that are preferentially present in cervical carcinomas are referred to as 
high-risk HPV types and include HPV16, which is present in approxi- 
mately 50% of all cervical cancers worldwide, and HPV18, found in 
another approximately 15%. Later comprehensive PCR-based studies 
revealed the presence of high-risk HPV DNA in virtually all cervical 
cancers worldwide (Bosch et al., 2002; Walboomers et al., 1999). The 
same high-risk DNA types are also present in HPV-associated cancers 
of other mucosal sites (Gillison, 2004). 

There are numerous lines of evidence that the high-risk HPV types 
play an essential etiologic role in cervical cancer. As noted earlier, 
virtually all cervical cancers contain high-risk HPV DNA, whereas 
other genital HPV types (the low-risk HPV) typically are absent 
(Walboomers et al., 1999). Second, extensive epidemiological studies 
revealed that persistent infection with a high-risk HPV type confers a 
dramatically elevated risk of cervical neoplasia (Bosch et al., 2002; 
Hildesheim et al., 1994; Ho et al., 1998; Koutsky et al., 1992; Schlecht 
et al., 2001; Wallin et al., 1999). Although the high-risk HPV types 
display worldwide distribution, some HPV16 and HP18 subtypes 
appear to display increased oncogenic potential compared to the 
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prototype strains (Berumen et al., 2001; Villa et al., 2000; Xi et al., 
1997). Third, high-risk HPVs express oncogene products that interact 
with known cellular growth-regulatory components, including Rb and 
p53, and induce measures of oncogenic transformation in cultured 
human cells (Bedell et al., 1989; Durst et al., 1987; Dyson et al., 1989; 
Halbert et al., 1991; Hawley-Nelson et al., 1989; Hudson et al., 1990; 
Munger et al., 1989a; Pirisi et al., 1987; Werness et al., 1990). Inacti- 
vation of p53 and Rb pathways commonly occurs in sporadically arising 
tumors as well. Fourth, these viral oncogene products are continuously 
expressed in cervical cancer cells, even many years after the cancer 
develops, and are required for the proliferation and survival of cervical 
cancer cell lines (Francis et al., 2000; Hwang et al., 1993; Schwarz et al., 
1985; von Knebel Doeberitz et al., 1988; Yee et al., 1985). Fifth, trans- 
genic mice expressing high-risk HPV oncogenes develop carcinomas, 
including cervical cancer in mice treated with estrogen (Arbeit et al., 
1994, 1996; Brake and Lambert, 2005; Lambert et al., 1993). Finally, 
vaccination that prevents persistent high-risk HPV infection prevents 
the development of precancerous cervical changes in women (Harper 
et al., 2004; Koutsky et al., 2002; Villa et al., 2005). In aggregate, these 
results provide compelling evidence that cervical carcinogenesis is 
initiated by high-risk HPV infection. 


IV. PATHOGENESIS OF CERVICAL CARCINOMA 


Most cervical carcinomas develop in the transformation zone of the 
cervix, where the columnar epithelium of the endocervix merges with 
the stratified squamous epithelium of the exocervix. High-risk HPV 
infection of this tissue causes flat cervical warts, which are usually 
asymptomatic and often regress spontaneously (Hildesheim et al., 
1994). However, a fraction of these infections generate a series of 
increasingly dysplastic cell populations (denoted as cervical intrae- 
pithelial neoplasia [CIN]) that can culminate in carcinoma-in situ 
(Koutsky et al., 1992). Cervical carcinoma-in situ can be readily trea- 
ted locally, but untreated lesions can progress to invasive cervical 
cancer for which there is limited treatment success. Invasive disease 
occurs in only a small minority of women infected with high-risk HPV, 
and the time between initial infection and cancer is typically >10 years 
(Goldie et al., 2003). This protracted natural history combined with the 
accessibility of the cervix for cell sampling is responsible for the suc- 
cess of Pap smear screening programs, which identify women with 
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HPV-induced precancerous changes at relatively early, treatable 
stages of their disease. These screening programs caused a dramatic 
decline in the incidence of cervical cancer in the developed world and 
represent a triumph of public health intervention (Dorn and Cutler, 
1959; Gustafsson et al., 1997; Johannesson et al., 1978). The absence of 
robust screening programs in resource-poor areas is in large part 
responsible for the high incidence of cervical cancer in much of the 
developing world, although other factors, such as host genotype or 
high frequency of HPV variants with increased oncogenicity, may also 
play a role (Berumen et al., 2001). The addition of HPV DNA typing to 
other methods of evaluation including standard cytologic analysis of 
cervical cells will presumably improve the diagnosis and management 
of cervical disease (Goldie et al., 2005; Schiffman et al., 2005b). 
During carcinogenic progression of cervical lesions, HPV DNA often 
integrates into cellular chromosomes, an event which can disrupt 
the E1 and E2 genes, preventing vegetative viral DNA replication 
and stimulating cell growth (Cullen et al., 1991; Hopman et al., 2004; 
Jeon et al., 1995; Schneider-Maunoury et al., 1987). Although the E2 
protein is an essential replication factor, it can also function as a 
transcriptional repressor of the viral major early promoter (Bernard 
et al., 1989; Bouvard et al., 1994; Dostatni et al., 1991; Steger and 
Corbach, 1997; Thierry and Yaniv, 1987). Accordingly, loss of E2 func- 
tion as a consequence of viral DNA integration relieves repression of 
the E6 and E7 oncogenes, and confers a growth advantage on these 
cells. The biological relevance of this mechanism is supported by 
the finding that E2 proteins from HPV16 strains with increased onco- 
genicity display reduced repressor activity (Berumen et al., 2001; 
Ordonez et al., 2004). In addition, in some situations the E2 protein 
can induce apoptosis, providing additional selective pressure for loss of 
E2 during malignant progression (Demeret et al., 2003; Desaintes 
et al., 1997; Sanchez-Perez et al., 1997; Webster et al., 2000). The E2 
protein also binds to a number of cellular proteins (e.g., Massimi et al., 
1999), so loss of E2 could have pleiotropic effects on cell function. 
Although there are frequently multiple sites of tandemly integrated 
HPV DNA in cervical cancer cells, typically only one copy of the viral 
genome is transcriptionally active (Van Tine et al., 2004b). This active 
copy expresses E6/E7 coding sequences fused to downstream cellular 
sequences, which may stabilize these mRNAs and further elevate E6/ 
E7 levels (Jeon et al., 1995). In the absence of integration, E6/E7 
expression can be increased by other mechanisms, such as mutations 
in the major viral promoter that interfere with binding of the E2 
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protein or cellular transcriptional repressors such as YY1 (Dong et al., 
1994; May et al., 1994; Rose et al., 1998). 

The E5 protein, the other oncogene product encoded by high-risk 
HPV, is expressed in some but not all cervical cancers (Chang et al., 
2001; Hsieh et al., 2000). Phylogenetic comparison of the E5 genes from 
various genital HPV types revealed a correlation between the E5 
sequence and carcinogenic potential, suggesting that the E5 protein 
plays a role in carcinogenesis (Bravo and Alonso, 2004; Schiffman 
et al., 2005a). 

The stepwise progression of natural lesions is mirrored by the be- 
havior of cells transfected with high-risk HPV DNA. Primary human 
cervical and foreskin keratinocytes, which normally undergo a limited 
number of cell divisions in culture, can be efficiently immortalized by 
coexpression of the E6 and E7 genes from the high-risk HPV types 
(Bedell et al., 1989; Durst et al., 1987; Halbert et al., 1991; Hawley- 
Nelson et al., 1989; Hudson et al., 1990; Kaur and McDougall, 1989; 
Munger et al., 1989a; Pirisi et al., 1987; Woodworth et al., 1988). In 
addition, the E6 protein stimulates the expression of telomerase, 
an RNA-dependent DNA polymerase that maintains the ends of chro- 
mosomes in proliferating somatic cells (Gewin and Galloway, 2001; 
Klingelhutz et al., 1996; Oh et al., 2001; Veldman et al., 2001). The 
ability of E6 and E7 proteins to interfere with p53 and Rb function and 
to influence telomerase activity is presumably the basis for their im- 
mortalizing activity (Kiyono et al., 1998), because these same path- 
ways are often affected during non-HPV immortalization (Vaziri and 
Benchimol, 1999). The E6 and E7 proteins also bind to other cellular 
targets, which may contribute to their carcinogenic effects (Antinore 
et al., 1996; Bernat et al., 2003; Bischof et al., 2005; Brehm et al., 1999; 
Luscher-Firzlaff et al., 1999). Notably, the high-risk E6 proteins bind 
to a number of PDZ domain-containing proteins, including some with 
presumed tumor suppressor activity (Gardiol et al., 1999; Glaunsinger 
et al., 2000; Lee and Laimins, 2004; Lee et al., 2000; Nakagawa and 
Huibregtse, 2000; Nguyen et al., 2003). The E6 and E7 genes from the 
low-risk genital HPV types do not display immortalization activity and 
show impaired interaction with cellular targets, demonstrating that 
these in vitro assays reflect the oncogenic potential of the various HPV 
types in people (Gage et al., 1990; Heck et al., 1992; Schlegel et al., 
1988; Woodworth et al., 1989). 

Even though HPV-immortalized keratinocytes express the viral on- 
cogenes, these cells are not initially tumorigenic in experimental ani- 
mals. However, continued passage of these cells results in the 
emergence of cells that can form tumors in animals, most likely 


134 DANIEL DIMAIO AND JOHN B. LIAO 


due to genetic instability resulting from the sustained action of the 
viral oncogenes (Durst et al., 1983, 1987; Kaur and McDougall, 1989; 
Pecoraro et al., 1991; Pirisi et al., 1987, 1988; Woodworth et al., 1988). 
The E5, E6, and E7 proteins from high-risk HPV eliminate various 
checkpoint control processes, permitting cells to replicate despite 
DNA damage and thereby accumulate mutations (Havre et al., 1995; 
Hickman et al., 1994; Jones and Munger, 1997; Kessis et al., 1993; Pan 
and Griep, 1994; Slebos et al., 1994, 1995; Song et al., 1998; Thomas 
and Laimins, 1998; White et al., 1994; Zhang et al., 2002). In addition, 
HPV16-immortalized keratinocytes display impaired DNA repair fol- 
lowing UV-irradiation (Rey et al., 1999), and the E6 protein can direct- 
ly interfere with DNA repair enzymes (Iftner et al., 2002; Srivenugopal 
and Ali-Osman, 2002). Finally, the E7 proteins perturb mitotic centro- 
somes, which in turn can result in structural and numerical chromo- 
some abnormalities (Duensing et al., 2001a,b), and high-risk HPV E2 
proteins can also cause genomic instability (Bellanger et al., 2005). As 
a consequence of these effects, genetic instability and chromosomal 
abnormalities are hallmarks of HPV-immortalized keratinocytes 
(Montgomery et al., 1995; Oda et al., 1996; Shin et al., 1996; Solinas- 
Toldo et al., 1997; Steenbergen et al., 1998). Some of the resulting 
mutations may activate cellular oncogenes, inactivate tumor suppres- 
sor genes, and disrupt various intracellular control mechanisms, 
thereby facilitating the acquisition of the tumorigenic phenotype (zur 
Hausen, 2000). Similar genetic events appear to take place in cervical 
cells undergoing malignant progression in women persistently infected 
with high-risk HPV (Duensing and Munger, 2004; Heselmeyer et al., 
1997; Hopman et al., 2004; Kisseljov et al., 1996; Matthews et al., 2000; 
Sherwood et al., 2000). 

Because HPV oncogene products function by modulating the activity 
and levels of cellular proteins, it is possible that the host cell genotype 
will directly influence HPV oncogene activity and cancer susceptibility. 
It has been reported that certain normal p53 variants are relatively 
resistant to E6-induced degradation, and that women harboring these 
variants have a reduced risk of developing cervical carcinoma (Storey 
et al., 1998). However, this finding is controversial, perhaps because of 
technical differences in sample acquisition and analysis (Madeleine 
et al., 2000; Makni et al., 2000). 
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Because HPV is a causative agent for cervical carcinoma, it may be 
possible to prevent this cancer with vaccines that block infection (Lowy 
et al., 1994; Stanley, 2002). As noted earlier, the L1 protein is the major 
constituent of the virus particle. Recombinant L1 protein expressed in 
various cell types assembles into particles that resemble authentic 
virus particles recovered from natural lesions (Hagensee et al., 1993; 
Hofmann et al., 1995; Kirnbauer et al., 1992, 1993; Rose et al., 
1993; Zhou et al., 1991). Unlike authentic virions, however, these 
virus-like particles (VLPs) do not contain viral DNA and hence are 
not pathogenic. Inoculation of various animal papillomavirus VLPs 
into experimental animals resulted in the efficient production of anti- 
bodies that recognized conformation-specific epitopes on authentic 
virions and neutralized their infectivity, thereby protecting the 
animals from infection after virus challenge (Breitburd et al., 1995; 
Christensen et al., 1996; De Bruijn et al., 1998; Kirnbauer et al., 1992, 
1996; Nardelli-Haefliger et al., 1997; Rose et al., 1994; Suzich et al., 
1995). On the basis of these results, VLPs composed of the HPV16 or 
the HPV18 L1 protein have been formulated into prophylactic vaccines 
and tested in female human volunteers. These studies demonstrated 
that repeated intramuscular injection of HPV VLPs generates high- 
titer neutralizing antibodies in humans and prevents persistent infec- 
tion by the HPV type(s) in the vaccine (Harper et al., 2004; Harro et al., 
2001; Koutsky et al., 2002; Villa et al., 2005). These vaccines caused a 
marked reduction in the incidence of type-specific CIN, the precursor 
lesion to cervical carcinoma. These studies strongly suggest that pro- 
phylactic vaccination against high-risk HPV types will prevent a 
substantial number of cervical carcinoma cases, although it will take 
many years to see a reduction in cancer incidence because there is 
a large pool of unvaccinated women who have already entered 
the protracted course of cervical carcinogenesis (Goldie et al., 2003). 
Nevertheless, adoption of prophylactic vaccination will cause an im- 
mediate reduction in the number of abnormal Pap smears, with result- 
ing financial savings in patient management (Kulasingam and Myers, 
2003). Prophylactic HPV vaccines will presumably also reduce 
the incidence of other HPV-associated anogenital and oropharyngeal 
cancers. 

Despite the very promising activity of VLP vaccines in the published 
clinical trials, there are numerous issues that have to be addressed 
before prophylactic HPV vaccines find widespread use. First, VLP L1 
vaccines are type-specific, that is, they will only protect against the 
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virus type in the vaccine and possibly closely related types. In clinical 
trials, women developed CIN containing high-risk HPV types not in- 
cluded in the vaccine (Harper et al., 2004). At the moment, the vaccines 
in development contain HPV16 and HPV18 VLPs, the two most com- 
mon high-risk types, which in aggregate account for approximately 
two-thirds of cervical carcinoma. Some vaccine formulations also in- 
clude HPV6 and HPV11, and so may also prevent anogenital warts 
caused by these low-risk types. However, women exposed to most 
other high-risk types will not be protected unless additional HPV types 
are included in the vaccine or strategies are developed to generate 
cross-reactive antibodies. Second, VLPs are relatively expensive to 
produce, distribute, and administer. This may preclude widespread 
use in the developing world, where the vast majority of cervical can- 
cers occur. Thus, efforts are underway to develop less expensive vac- 
cines such as ones that do not require intact VLPs, cold storage, needle 
injection, or multiple doses (Ohlschlager et al., 2003; Yuan et al., 2001). 
Third, it is not known how long protection will last, what the optimal 
age for vaccination is, and whether or not booster inoculations are 
required. Fourth, for maximal effect, girls should be vaccinated before 
they initiate sexual activity, but social, political, or religious considera- 
tions may impede the acceptance of vaccinating girls to prevent what 
may be characterized as a sexually transmitted disease. It is also 
possible that the availability of the vaccine will deter women from 
Pap smear screening. This is a potentially serious concern because 
there will be no protection against high-risk types not in the vaccine. 
The vaccine may also deter pharmaceutical companies from aggres- 
sively pursuing anti-HPV drugs. Finally, prophylactic vaccines will 
be of limited utility to women who are already persistently infected 
with high-risk HPV or whose infected cells have undergone preneo- 
plastic changes. In these women, the presence of infectious virus and 
the infection of additional cells is presumably no longer required for 
carcinogenic progression. Nevertheless, even in infected women, pro- 
phylactic vaccination may provide some benefit by inhibiting the auto- 
infection of neighboring cells or by reducing virus load and hence 
transmission to new hosts. 


VI. THERAPEUTIC VACCINATION AGAINST CERVICAL CARCINOMA 
Most women infected with a high-risk HPV type do not develop 


cervical cancer, and the incidence of cervical cancer is increased in 
immunosuppressed women, including those with AIDS (Franceschi 
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et al., 1998; Frisch et al., 2000; Goldie et al., 2003; Halpert et al., 1986; 
Palefsky and Holly, 2003; Petry et al., 1994; Serraino et al., 1999). 
These findings suggest that cell-mediated immunity plays a major role 
in clearing cervical HPV infections and controlling precancerous le- 
sions. In addition, studies in women with cervical neoplasia demon- 
strated the existence of E6- and E7-specific T cells in these patients, 
but the role of these cells in tumor surveillance is unclear (Bontkes 
et al., 2000; Eiben et al., 2002; Evans et al., 1997; Kadish et al., 1997, 
2002; Steele et al., 2005). Nevertheless, measures that stimulate cellu- 
lar immunity to HPV proteins may be therapeutically useful. Accord- 
ingly, an alternative vaccine strategy called therapeutic vaccination is 
also under development, with the goal of treating preneoplastic lesions 
or even advanced cancer by generating cytotoxic T lymphocytes (CTLs) 
that destroy cervical cancer and precancer cells, which express the 
HPV E6 and E7 proteins (Ressing et al., 1996; Stanley, 2002). Although 
these proteins function in the nucleus, proteolytic fragments are dis- 
played at the surface of infected cells in association with major histo- 
compatibility complex molecules, where they can serve as immune 
targets. Furthermore, as described in the next section, continuous 
expression of these viral proteins is required for the sustained survival 
and proliferation of the cancer cells, so the cells are unlikely to escape 
the immune response simply by losing expression of the viral proteins. 

In mouse models, various therapeutic vaccination strategies can 
elicit a cytolytic CD8* T cell response directed against high-risk HPV 
E7 and/or E6 proteins, resulting in impaired growth or elimination of 
transplantable tumors expressing these viral antigens (Baldwin et al., 
2003; Borysiewicz et al., 1996; Chu et al., 2000; Feltkamp et al., 1993; 
Hariharan et al., 1998; Lin et al., 1996; Velders et al., 2001). However, 
so far this approach has shown less impressive results in human 
trials against CIN and cervical carcinoma (Borysiewicz et al., 1996; 
Muderspach et al., 2000; van Driel et al., 1999). It is possible 
that cervical cancer cells may not be good targets for CTL killing 
because they express low levels of required major histocompatibility 
complex class I molecules (Bontkes et al., 1998; Connor and Stern, 
1990; Koopman et al., 2000). In addition, the concentration of the viral 
proteins in human cancer cells may not be sufficient to elicit a robust 
immune response (Azoury-Ziadeh et al., 2001), or the viral proteins 
may induce a state of T-cell tolerance (Borchers e¢ al., 1999). There are 
numerous reports that papillomavirus E5, E6, and E7 proteins 
can impair antigen presentation and other aspects of the innate or 
acquired immune response (Ashrafi et al., 2002, 2005; Barnard and 
McMillan, 1999; Lee et al., 2001; Marchetti et al., 2002; Park et al., 
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2000; Perea et al., 2000; Ronco et al., 1998; Zhang et al., 2003). 
Moreover, skin transplantation studies in E7 transgenic mice suggest 
that epithelial cells expressing the E7 protein are not good 
targets for cytotoxic T cells in the absence of other immune stimulation 
(Dunn et al., 1997; Melero et al., 1997). Approaches to overcome these 
obstacles are likely to be required before therapeutic vaccination is 
successful. 

Unlike prophylactic vaccination, which would strive to achieve full 
coverage of susceptible populations, therapeutic vaccines would be 
reserved for people who already have persistent HPV infections or 
cancer. This consideration will influence the testing, cost, and distri- 
bution of therapeutic vaccines. Attempts are also underway to develop 
combined prophylactic/therapeutic chimeric VLP vaccines that simul- 
taneously raise neutralizing antibodies to L1 and CTLs to E6 or E7 
(Greenstone et al., 1998; Kaufmann et al., 2001). 


VII. Prospects FoR ANTIVIRAL TREATMENTS OF CERVICAL CARCINOMA 


The sustained expression of HPV oncogene products in cervical can- 
cer cells suggests that these viral proteins may be required for the 
survival and proliferation of the cells. To test this possibility, a number 
of approaches have been used to inhibit the expression or activity of 
the E6 or E7 proteins in cervical cancer cell lines, including approaches 
that reduce the amount of viral RNA in cells or, less commonly, that 
interfere with the action of the viral proteins. Antisense RNA, RNA 
interference, or ribozymes have been used to degrade viral RNA, and 
transcriptional repressors have been used to inhibit transcription of 
the viral genes. Small proteins that bind and inhibit the E6 or E7 
proteins have also been used. These experiments showed that inhibi- 
tion of HPV function by a variety of methods caused either growth 
arrest or apoptosis in cervical carcinoma cell lines (Alvarez-Salas et al., 
2003; Steele et al., 1992; Storey et al., 1991; von Knebel Doeberitz et al., 
1988). The most dramatic effects were obtained after introduction of 
the bovine papillomavirus E2 transcription factor, which binds to 
the E6/E7 promoter in the integrated HPV genome and efficiently 
represses expression of the viral oncogenes (Desaintes et al., 1997; 
Dowhanick et al., 1995; Hwang et al., 1993). E6/E7 repression results 
in the reactivation of the Rb and p53 tumor suppressor pathways and 
in the rapid acquisition of a growth arrested state that closely resem- 
bles replicative senescence achieved by primary cells after they have 
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reached the end of their normal lifespan during serial passage in 
culture (Francis et al., 2000; Goodwin and DiMaio, 2000; Goodwin 
et al., 2000; Hwang et al., 1996; Wells et al., 2000; Wu et al., 2000). 
Further genetic analysis indicated that E6 expression is continuously 
required for p53 inactivation in cervical cancer cell lines, and that E7 
expression is continuously required for Rb inactivation (DeFilippis 
et al., 2003). Several experiments indicated that Rb activation is pri- 
marily responsible for induced senescence following E7 repression 
(DeFilippis et al., 2003; Psyrri et al., 2004; Wells et al., 2000). 

The continuous requirement for viral protein function in cervical 
cancer cells implies that the cellular mutations that accumulate dur- 
ing carcinogenic progression are not sufficient to maintain these cells 
in a proliferative state and suggests that agents that interfere with the 
expression of the HPV genome or block the interaction of E6 or E7 with 
their cellular targets may restore tumor suppressor function and in- 
hibit the growth or survival of the cells (Sterlinko Grm and Banks, 
2004). Because E7 has proapoptotic activity that can be blocked by the 
E6 protein (Pan and Griep, 1994; Stoppler et al., 1998), interventions 
that specifically block E6 function may be particularly useful by induc- 
ing apoptosis triggered by ongoing E7 expression (Butz et al., 2000, 
2003; Horner et al., 2004). Agents that inhibit the function of E1 or E2 
are not likely to be useful in most cases of cancer because these 
proteins are often lost in the cancers and the integrated viral DNA 
replicates passively in the cellular chromosomes. The absence of 
viral genomes and proteins from uninfected normal cells suggests that 
it will be possible to develop highly specific antiviral agents with 
relatively minor side effects. 


VIII. CONCLUSIONS AND PROSPECTS 


Epidemiological, laboratory, and clinical studies have established 
that high-risk HPVs play an obligatory role in the genesis of cervical 
carcinoma. Molecular and cell biological studies have revealed the basic 
features of the viral life cycle and provided detailed understanding of 
the effects of viral proteins on virus propagation and on the cellular 
phenotype. These advances have provided the information necessary to 
develop several promising strategies to combat papillomavirus infec- 
tion or treat HPV-associated cancer. We can envision a future where 
most cases of cervical carcinoma are prevented by prophylactic vaccina- 
tion, screening and early intervention, or successfully treated by thera- 
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peutic vaccination or nontoxic antiviral agents (Schiffman and Castle, 
2005). This optimistic scenario is a direct outcome of our understanding 
of the basic virology of cervical carcinogenesis. 
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ABSTRACT 


Viral infection of plants is a complex process whereby the virus 
parasitizes the host and utilizes its cellular machinery to multiply 
and spread. In turn, plants have evolved signaling mechanisms that 
ultimately limit the ingress and spread of viral pathogens, resulting in 
resistance. By dissecting the interaction between host and virus, 
knowledge of signaling pathways that are deployed for resistance 
against these pathogens has been gained. Advances in this area have 
shown that resistance signaling against viruses does not follow a 
prototypic pathway but rather different host factors may play a role 
in resistance to different viral pathogens. Some components of viral 
resistance signaling pathways also appear to be conserved with those 
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functioning in signaling pathways operational against other nonviral 
pathogens, however, these pathways may or may not overlap. 
This review aims to document the advances that have improved our 
understanding of plant resistance to viruses. 


I. INTRODUCTION 


Plant viruses are a major threat to the cultivation of many agricul- 
tural crops, causing serious economic losses worldwide. For many 
years, these pathogens have been controlled by conventional plant 
protection methods such as crop rotation, destruction of infected 
sources, breeding for resistance, and chemical control of relevant 
insect vectors (Bos, 2000; Hull, 2002). 

Most plant species are resistant to most plant viruses, which suggests 
that susceptibility to viral pathogens is the exception rather than the 
rule. Resistance may be due to a variety of factors. For example, the 
plant may lack a factor(s) that is required for viral replication and/or 
movement. Alternatively, the plant’s basal defenses may be sufficient 
to prevent viral replication and spread. The above two forms of resis- 
tance are often grouped under a phenomenon known as nonhost resis- 
tance. Basal defenses may include silencing of the viral genetic 
material either by degradation of the viral RNA or via inhibition of 
its translation (Dunoyer and Voinnet, 2005). RNA silencing has been 
an active area of research over the past decade, and the results are 
covered in several excellent reviews (Baulcombe, 2004; Dunoyer and 
Voinnet, 2005; Qu and Morris, 2005). Hence, it is not the focus of this 
chapter and will be only briefly discussed near the end of the chapter 
for completeness. 

In comparison to nonhost resistance, some plants prevent viral 
infection by activating defense responses following pathogen recogni- 
tion. This process involves strain-specific recognition of a virus- 
encoded avirulence (Avr) factor through direct or indirect interaction 
with the corresponding host resistance (R) gene product. Since resis- 
tance is dependent on the presence of both the pathogen Avr factor and 
its corresponding host R protein, it is termed gene-for-gene-mediated 
resistance (Flor, 1971). Such an interaction (also known as an incom- 
patible interaction) triggers one or more defense signaling cascades 
and is often associated with induction of a hypersensitive response 
(HR). The HR is a form of programmed cell death, which results in 
the formation of necrotic lesions due to death of plant cells at and 
around the site of pathogen entry. An HR is one of the first visible 
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manifestations of pathogen-induced host defense responses and it is 
thought to help prevent pathogen multiplication and spread by confin- 
ing the pathogen to the dead cells. However, increasing evidence sug- 
gests that the HR is not a prerequisite for gene-for-gene-mediated 
disease resistance. For example, a visible HR does not develop either 
in potato during Rx-mediated resistance against Potato virus X (PVX) 
(Kohm et al., 1993) or in Arabidopsis plants overexpressing the R gene 
HRT, which confers an HR to Turnip crinkle virus (TCV). Nonetheless, 
these plants are resistant to their respective viral pathogens (Chandra- 
Shekara et al., 2004; Cooley et al., 2000). Moreover, HR and resistance 
can be uncoupled in the Arabidopsis-TCV pathosystem; susceptible 
plants show similar amounts of virus in their systemic tissue irrespec- 
tive of the presence or absence of an HR on their inoculated leaves. This 
result suggests that the HR by itself is not sufficient to prevent viral 
spread to the uninoculated tissues. Thus, while an HR is frequently 
associated with resistance, it is not necessarily required or sufficient for 
resistance. 

R gene-mediated pathogen recognition often involves the accumula- 
tion of various phytohormones, including salicylic acid (SA), jasmonic 
acid (JA), and/or ethylene, which in turn signal the activation of defense 
gene expression. These phytohormones trigger separate and/or overlap- 
ping pathways whose combined effects confer resistance and prevent 
spread of the pathogen to the uninoculated parts of the plant. 

R genes cloned to date can be grouped into six classes, based on their 
structural characteristics and protein motifs (Martin et al., 2003). 
These include proteins with a serine/threonine kinase catalytic domain 
and an N-terminal myristylation motif (class I, for example, Pto in 
tomato), proteins with leucine-rich repeats (LRRs), a nucleotide bind- 
ing site (NBS), and an N-terminal coiled coil (CC) sequence (class II, 
for example, HRT in Arabidopsis, and Xal in rice), proteins similar to 
those in class II except for the presence of a Toll-interleukin-1 receptor 
(TIR)-like domain instead of a CC domain at their N-terminus (class III, 
for example, N in tobacco, L in flax, and RPP5 in Arabidopsis), proteins 
that have a transmembrane (TM) domain, an extracellular LRR and a 
small putative cytoplasmic tail without any identifiable motifs (class IV, 
for example, Cf2 and Cf9 in tomato), and proteins that contain an 
extracellular LRR, a TM domain, and a cytoplasmic serine/threonine 
kinase domain (class V, for example, Xa21 in rice). Class VI is composed 
of R proteins that do not show characteristics similar to proteins belong- 
ing to any of the above classes (for example, RPW8 in Arabidopsis and 
Vel and Ve2 in tomato). The proteins belonging to classes I-III lack a 
TM domain and are therefore thought to be localized intracellularly. 
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Strikingly, most of the R genes cloned thus far belong either to class II or 
HI, suggesting that NBS-LRR type R proteins predominate over 
the other classes. It is, therefore, no particular surprise that a majority 
of the cloned R genes specifying resistance to viral pathogens also 
belong to the NBS-LRR category (Table I). Analysis of downstream 
signaling reveals that a few viral R genes activate resistance via unique 
pathways not used by other R genes. By contrast, other viral R genes 
utilize pathways in which at least some of the downstream components 
are shared, in many cases these components are also common to path- 
ways triggered by R genes specifying resistance to nonviral pathogens. 

Host resistance to viral pathogens is genetically accessible and, 
therefore, has been used widely in crop breeding. This approach con- 
tinues to be a cost effective and commercially feasible method for 
providing protection against viral pathogens (Lecoq et al., 2004). 
Molecular cloning of R genes has not only aided in crop breeding but also 
has facilitated investigation of the mechanisms underlying resistance to 
viruses. However, our understanding of resistance signaling is far from 
clear and, certainly, more work is needed to elucidate the complex signal- 
ing cascades leading to resistance. This review covers various aspects of 
resistance signaling employed by plants to counter viral pathogens and 
details several R gene-specified pathways operational against different 
viral pathogens. 


II. R GeNE-MEDIATED RESISTANCE TO VIRAL PATHOGENS 


A. N Gene-Mediated Resistance to Tobacco mosaic virus in Tobacco 


The interaction between Tobacco mosaic virus (TMV) and tobacco 
plants containing the N gene is a classic example of gene-for-gene 
resistance (Holmes, 1938). The N gene encodes a TIR-NBS-LRR protein 
(Whitham et al., 1994) that interacts directly or indirectly with the 
helicase domain of the TMV replicase (Erickson et al., 1999; Padgett 
et al., 1997). N-mediated recognition of TMV leads to HR development 
in the host, which helps restrict the virus to a region in and immediately 
surrounding the necrotic lesions. By contrast, tobacco plants lacking 
the N gene fail to develop an HR and allow systemic spread of TMV, 
resulting in mosaic disease symptoms characterized by intermingled 
areas of light and dark green leaf tissue. Upon TMV inoculation, N-gene 
expression is upregulated in both the inoculated and uninoculated 
tissues of resistant plants (Levy et al., 2004). The approximately 
fourfold increase in N transcripts in uninoculated tissues suggests that 


TABLE I 


List oF DOMINANT R GENES AGAINST PLANT VIRUSES, THEIR PROTEIN STRUCTURE, AND CORRESPONDING AVIRULENCE FACTORS 





R gene Host* Pathogen* Avr factor* Predicted domains’ Reference 

N N. tabacum TMV Helicase TIR-NBS-LRR Whitham et al., 1994 
HRT A. thaliana TCV CP CC-NBS-LRR Cooley et al., 2000 

RCY1 A. thaliana CMV CP CC-NBS-LRR Takahashi et al., 2002 
RTM1 A. thaliana TEV -1 Jacaline-like Chisholm eż al., 2000 
RTM2 A. thaliana TEV — Small HSP-like Whitham et al., 2000 
Rx1 S. tuberosum PVX CP CC-NBS-LRR Bendahmane et al., 1999 
Rx2 S. tuberosum PVX CP CC-NBS-LRR Bendahmane et al., 2000 
Nb S. tuberosum PVX MP — Marano et al., 2002 

NYtor S. tuberosum PVY — — Celebi-Toprak et al., 2002 
Ry S. tuberosum PVY NIaPro — Mestre et al., 2003 

Sw-5 L. esculentum TSWV — CC-NBS-LRR Brommonschenkel et al., 2000 
Tm-1 L. esculentum ToMV RdRp — Meshi et al., 1988 

Tm-2 L. esculentum ToMV MP CC-NBS-LRR Lanfermeijer et al., 2005 
Tm-2? L. esculentum ToMV MP CC-NBS-LRR Lanfermeijer et al., 2003 
Rsv1 G. max SMV HC-Pro and P3 CC-NBS-LRR Hayes et al., 2004 
TuRBO1 B. napus TuMV CI - Jenner et al., 2000 
TuRBO1b B. napus TuMV CI - Jenner et al., 2000 
TuRBO3 B. napus TuMV P3 - Hughes et al., 2003 
RuRBO4 B. napus TuMV P3 - Jenner et al., 2002 
RuRBOS B. napus TuMV CI - Jenner et al., 2002 

LL L?, L4 C. annum TMV CP — Berzal-Herranz et al., 1995 


* A. thaliana, Arabidopsis thaliana; N. tabacum, Nicotiana tabacum; S. tuberosum, Solanum tuberosum; L. esculentum, Lycopersi- 
cum esculentum; G. max, Glycine max; B. napus, Brassica napus; C. annum, Capsicum annum. 
t TCV, Turnip crinkle virus; CMV, Cucumber mosaic virus; TMV, Tobacco mosaic virus; PVX, Potato virus X; PVY, Potato virus Y, TSWV, 
Tomato spotted wilt virus; ToMV, Tomato mosaic virus; SMV, Soybean mosaic virus; TEV, Tobacco etch virus; TuMV, Turnip mosaic virus. 
t CP, coat protein; MP, movement protein; RdRp, RNA dependent RNA polymerase; CI, cylindrical inclusion; P3, protein 3; NIaPro, 


protease domain of nuclear inclusion; HC-Pro, helper component protease. 


$ TIR-NBS-LRR, Toll-interleukin-1 receptor-nucleotide binding site-leucine-rich repeat; CC-NBS-LRR, coiled coil-nucleotide binding 
site-leucine-rich repeat; HSP, heat shock protein. 


1 Not known. 
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a TMV-induced systemic signal is involved in induction of N gene 
expression and resistance to TMV (Levy et al., 2004). Similar to a few 
other TIR-NBS-LRR R genes (Ayliffe et al., 1999; Parker et al., 1997), N 
transcripts undergo alternative splicing to generate two mRNAs, de- 
signated as Ng and N, (Dinesh-Kumar and Baker, 2000). Ns and Ny, are 
predicted to encode full-length and truncated N proteins, respectively 
(Dinesh-Kumar and Baker, 2000); the relative levels of these tran- 
scripts change after TMV infection. Ng transcript levels are higher 
through 3 h after TMV infection while the N, transcript predominates 
4-8 h after infection. Notably, both transcripts are required to attain 
resistance to TMV (Dinesh-Kumar and Baker, 2000). The molecular 
mechanism(s) regulating alternative splicing is not known, but it has 
been suggested that protein kinases involved in the N-TMV interaction 
or rapid cellular changes function as a trigger to induce alternative 
splicing (Dinesh-Kumar and Baker, 2000). 

N-mediated resistance to TMV is dependent on SA, and increased SA 
levels are observed during the HR to TMV in both inoculated and 
systemic tissues. These increases correlate with the induction of 
pathogenesis-related (PR) genes in both tissues (Delaney et al., 1994; 
Malamy et al., 1992). Studies have suggested that nitric oxide (NO) 
also plays a role in signaling TMV resistance. NO synthase (NOS) 
activity is induced by TMV infection in an N gene-dependent manner 
(Durner et al., 1998). Furthermore, exogenous application of NO or an 
NO donor induces SA accumulation and triggers PR-1 and phenylala- 
nine ammonia lyase gene expression. Conversely, exogenous applica- 
tion of NOS inhibitors abolishes TMV-induced PR-1 expression in an N 
gene-dependent manner (Durner et al., 1998). SA appears to be neces- 
sary for NO function, since exogenous application of NO donors failed 
to induce PR-1 in SA-deficient nahG transgenic tobacco (Durner et al., 
1998). SA also has been shown to induce the expression of a tobacco 
RNA-dependent RNA polymerase (RdRP) gene; RdRPs have been im- 
plicated in RNA silencing and in limiting the accumulation and spread 
of RNA viruses (Xie et al., 2001). In addition, increased SA levels can 
contribute to N-mediated resistance by inhibiting the respiratory elec- 
tron transport chain, leading to an increase in mitochondrial reactive 
oxygen species (Chivasa and Carr, 1998; Gilliland et al., 2003; Singh 
et al., 2004). 

The signaling pathway leading to N-mediated resistance also has 
been shown to involve polyamine metabolism. The HR to TMV is 
associated with an ~20-fold increase in spermine levels in the inocu- 
lated leaves (Yamakawa et al., 1998). Moreover, exogenous treatment 
with spermine induces expression of acidic PR genes and reduces the 
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size of TMV-induced lesions. However, spermine treatment does not 
induce SA accumulation and, conversely, exogenous application of SA 
does not enhance spermine accumulation (Yamakawa et al., 1998). 
Spermine-derived signaling involves a Cys2/His2-type zinc finger 
protein (ZFT1) that is upregulated during TMV infection in an 
N-dependent manner. TMV-induced expression of ZFT1 remains un- 
impaired in nahG transgenic tobacco plants, indicating that spermine 
signaling involves an SA-independent pathway. It has been suggested 
that ZFT1 functions as a transcriptional repressor for spermine sig- 
naling and it may restrict the virus by accelerating necrotic lesion 
formation (Uehara et al., 2005). Consistent with this possibility, over- 
expression of ZFT1 resulted in enhanced tolerance to TMV (Uehara 
et al., 2005). 

Characterization of N-mediated signaling has led to the identifi- 
cation of several proteins that play important roles in resistance 
responses to TMV (Fig. 1). Remarkably, several of these downstream 
components were first identified in Arabidopsis as signal transducers 
for pathways leading to resistance against nonviral pathogens. This 
finding suggests that resistance signaling components are conserved 
not only across plant species but also for different types of pathogens. 
In tobacco, N-mediated resistance is dependent on RAR1, SGT1, and 
NPR1-like proteins (Liu et al., 2002a). RAR1 and SGT1 can participate 
in resistance pathways triggered by either TIR- or CC-NBS-LRR-type 
R proteins (Aarts et al., 1998; Austin et al., 2002; Azevedo et al., 2002; 
Liu et al., 2002a; Muskett et al., 2002; Tor et al., 2002). Both RAR1 and 
SGT1 physically interact with each other, as well as with other pro- 
teins (Azevedo et al., 2002; Liu et al., 2002a). SGT1 is a component of 
the SCF-type E3 ubiquitin ligase complex (Kitagawa et al., 1999; 
Lyapina et al., 2001; Matsuzawa and Reed, 2001), which facilitates 
the attachment of ubiquitin to specific protein substrates for 
subsequent degradation by the 26S proteasome (Deshaies, 1999). In 
addition to RAR1, SGT1 also interacts with SKP1, which is another 
component of the SCF-type E3 ubiquitin ligase complex (Liu et al., 
2002b). Silencing of SKP1 results in loss of resistance to TMV, and the 
effect is similar to that seen upon silencing N (Liu et al., 2002b). Both 
RAR1 and SGT1 are also known to interact with the COP9 signalo- 
some, which is an evolutionarily conserved multiprotein complex 
involved in protein degradation and light signaling (Wei and Deng, 
1999; Wei et al., 1998). The COP9 multiprotein complex can, in turn, 
interact with the components of the SCF-type 3 ubiquitin ligase com- 
plex (Lyapina et al., 2001), and silencing of the CSN3 and CSN8 
subunits of the COP9 complex abolishes N-mediated resistance to 
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Fic 1. A sketch of components involved in the N- (A), HRT- (B), and RCY1- (C) 
mediated resistance-signaling pathways, which confer resistance to TMV in tobacco 
and to TCV and CMV in Arabidopsis, respectively. Defense signaling in each of these R 
protein-triggered pathways is initiated upon indirect interaction between the R protein 
and its corresponding Avr factor encoded by the virus. Thick lines in (A) and (B) indicate 
light-mediated regulation of defense pathways. NO-mediated signaling plays a positive 
role in N-mediated resistance to TMV, but it is not yet clear if NO acts upstream or 
downstream of EDS1 or the SGT1-RAR1 complex. Dash-dotted lines in (B) indicate SA- 
mediated feedback regulation of EDS1 and PAD4. A positive feedback mechanism also 
regulates SA-mediated induction of HRT. The dashed line in (B) indicates resistance 
triggered by overexpression of HRT, which appears to act independently of SA. Dashed 
lines in (C) indicate partial dependence of RCY1-mediated signaling on the SA and 
ethylene pathways. 


TMV (Liu et al., 2002b). It is possible that the COP9 signalosome 
modulates resistance by affecting the functions of its interacting part- 
ners or via its role in light signaling. The latter possibility is supported 
by the observation that HR formation and TMV resistance are com- 
promised in the dark (Chandra-Shekara et al., 2006). 

Signal transduction via the N protein also involves heat shock pro- 
tein 90 (HSP90). HSP90 interacts with N, and silencing of HSP90 
results in loss of resistance to TMV (Liu et al., 2004a). Heat shock 
proteins are known to play roles in the folding, activation, and assem- 
bly of proteins involved in signal transduction, cell cycle control, or 
transcriptional regulation; they may serve a similar function in 
N-mediated signaling (Picard, 2002; Richter and Buchner, 2001). Since 
HSP90 interacts with SGT1 and RAR1, these proteins, together with N, 
may exist as a complex (Liu et al., 2004a). Like RAR1 and SGT1, HSP90 


PLANT DEFENSE AGAINST VIRUSES 169 


functions in other R gene-signaling pathways. For example, all three 
proteins are required for the function of RPS2, which mediates resis- 
tance to Pseudomonas syringae pv. tomato DC3000 in Arabidopsis 
(Takahashi et al., 2003). 

A CC-NBS-LRR class resistance protein, N requirement gene 1 
(NRG1), was shown to play a critical role in N-mediated resistance 
(Peart et al., 2005). In tobacco containing the N gene, silencing of 
NRG1 resulted in loss of HR and resistance to TMV. This suggests 
that resistance to TMV requires both TIR-NBS-LRR and CC-NBS-LRR 
classes of resistance proteins (Peart et al., 2005), and that NRG1 may 
be acting downstream of or in parallel to the N signaling pathway. 
N-mediated signaling, but not NRG1-mediated signaling, is dependent 
on EDS1, whereas both N- and NRG1-mediated pathways are depen- 
dent on SGT1 (Peart et al., 2005). Results obtained from other systems 
also provide compelling evidence that multiple NBS-LRR proteins are 
a common feature of resistance signaling pathways. For example, 
RPP2-mediated resistance against Peronospora parasitica requires 
two NBS-LRR proteins encoded by the adjacent genes RPP2a and 
RPP2b (Sinapidou et al., 2004). 

N-mediated resistance to TMV is also associated with the activation 
of several mitogen-activated protein kinases (MAPKs), including the 
SA-induced protein kinase (SIPK), the wound-induced protein kinase 
(WIPK), and an MAPKK, NtMEK2, which acts as an upstream kinase of 
SIPK and WIPK. These kinases also are activated by various elicitors, 
suggesting that they function as convergence points after the percep- 
tion of different pathogens, pathogen-derived elicitors, and nonbiotic 
stresses (Cardinale et al., 2000; Desikan et al., 2001; Jin et al., 2008; 
Lee et al., 2001a; Mikolajezyk et al., 2000; Niihse et al., 2000; Romeis 
et al., 1999; Zhang and Klessig, 1998a,b; Zhang et al., 1998, 2000). 
Since TMV resistance is compromised in tobacco silenced for 
expression of NtMEK2, SIPK, or WIPK, these kinases appear to play 
important roles in N-mediated signaling (Jin et al., 2003; Liu et al., 
2004b). 


B. HRT-Mediated Resistance to Turnip crinkle virus in Arabidopsis 


The carmovirus TCV has a wide host range that includes several 
members of the cruciferous family such as Arabidopsis. Most Arabi- 
dopsis ecotypes are susceptible to TCV; however, a resistant line, 
designated Di-17 was isolated from the Dijon (Di) ecotype (Dempsey 
et al., 1993). Following TCV infection, Di-17 plants develop an HR, 
express several defense genes, including PR-1, PR-2, PR-5, and GST1, 
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and accumulate SA and the phytoalexin, camelexin (Dempsey et al., 
1997; Kachroo et al., 2000). By contrast, plants lacking HRT fail to 
develop an HR after TCV infection, they also exhibit basal-level 
expression of the PR and GST1 genes and accumulate low levels of 
SA and camelexin. These plants allow systemic spread of the virus, 
which is associated with appearance of crinkled leaves and drooping 
bolts followed by death of the plant (Dempsey et al., 1997; Kachroo 
et al., 2000). The HR to TCV is conferred by a CC-NBS-LRR type 
R protein, designated HRT (hypersensitive response to TCV) (Cooley 
et al., 2000). In addition to its role in HR development, HRT is required 
for resistance to TCV. However, HRT alone is not sufficient to 
confer resistance to TCV, as F1 plants and ~75% of HR-developing F2 
plants derived from a cross between resistant (Di-17) and susceptible 
(Columbia-0 [Col-0]) ecotypes succumb to disease. Furthermore, ~90% 
of transgenic Col-0 plants expressing an HRT transgene develop an 
HR but are still susceptible, confirming that HRT alone is insufficient 
and that other loci also may regulate resistance to TCV (Cooley et al., 
2000). Subsequent studies revealed that a recessive allele at a second 
locus, designated RRT (regulates resistance to TCV), is required to- 
gether with HRT to confer resistance (Chandra-Shekara et al., 2004; 
Kachroo et al., 2000). Analysis of HRT RRT homozygous plants sug- 
gests that RRT does not have a role in HR development or PR gene 
induction (Chandra-Shekara et al., 2004). 

Analysis of the downstream signaling pathway indicates that HRT- 
mediated resistance is independent of JA and ethylene but dependent 
on SA (Fig. 1; Chandra-Shekara et al., 2004; Kachroo et al., 2000). TCV 
induces approximately a 10-fold increase in SA levels in Di-17 but 
not in Col-0. This increase appears to be critical for signaling resis- 
tance since SA-deficient plants expressing the nahG transgene, which 
encodes the SA-degrading enzyme salicylate hydroxylase, are com- 
promised for HRT-mediated resistance. Similarly, mutations in the 
EDS1, EDS5, PAD4, or SID2 genes, which suppress SA accumula- 
tion following TCV infection, compromise HRT-mediated resistance 
(Chandra-Shekara et al., 2004). It has previously been demonstrated 
that mutations in EDS1, EDS5, PAD4, or SID2 enhance susceptibility 
to various bacterial and oomycete pathogens by affecting SA percep- 
tion or accumulation (Falk et al., 1999; Jirage et al., 1999; Nawrath 
et al., 2002; Wildermuth et al., 2001). The SA level is affected not only 
by the gene products directly involved in SA synthesis [such as SID2, 
which encodes isochorismate synthase (Wildermuth et al., 2001)] but 
also by factors that participate in resistance signaling, and directly or 
indirectly affect SA synthesis and/or accumulation (such as EDS1, 
PAD4, EDS5). For example, mutations in EDS1 and PAD4 compromise 
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SA synthesis and cause increased susceptibility to bacterial and oomy- 
cete pathogens (Falk et al., 1999; Jirage et al., 1999). The eds1 and 
pad4 mutations also block the pathogen-activated expression of EDS5, 
which encodes another component of the SA-signaling pathway. 

HRT-mediated signaling, unlike most CC-NBS-LRR R proteins, is 
dependent on EDS1 but independent of NDR1. This finding is unexpect- 
ed since R proteins with a CC-NBS-LRR structure, such as HRT, 
usually require NDR1 to signal resistance responses, while R proteins 
with a TIR-NBS-LRR structure utilize EDS1 (Aarts et al., 1998; Dangl 
and Jones, 2001). Besides HRT, the only other CC domain-containing 
R protein that utilizes an EDS1-dependent pathway is RPW8, which 
confers broad-spectrum resistance to powdery mildew (Xiao et al., 
2005). However, RPW8 is not an NBS-LRR type R protein; it contains 
an N-terminal TM domain in addition to the CC domain. The HRT 
paralog RPP8 found in the Landsberg ecotype requires neither NDR1 
nor EDS1 for its downstream signaling (McDowell et al., 2000). It is 
possible that EDS1 plays a signaling role in HRT-mediated resistance 
and/or has an effect on resistance to TCV via its effect on SA levels, 
which appear to be critical for resistance (Chandra-Shekara et al., 
2004). In contrast, HRT-mediated resistance is independent of two 
other downstream factors RAR1 and SGT1b. 

HRT-mediated resistance is dependent on PAD4, which like EDS1 
and EDSS5, is involved in regulating SA levels. PAD4 also regulates 
SA-induced expression of HRT, which suggests that besides governing 
SA levels, PAD4 has other regulatory functions in the HRT-triggered 
pathway. Both PAD4 and EDS1 show homology to triacylglycerol 
lipases/esterases. This raises the possibility that lipid/fatty acid signal- 
ing is involved in resistance to viral pathogens. A role for lipids/fatty 
acids in resistance to viral pathogens also is suggested by the finding 
that a mutation in the yeast delta-9 desaturase impairs replication of 
Brome mosaic virus (BMV) (Lee et al., 2001). Since delta-9 desaturase 
catalyzes the formation of oleic acid, the level of oleic acid may influ- 
ence BMV replication. Replication of BMV is much more sensitive than 
yeast growth to reduced levels of this unsaturated fatty acid (Lee et al., 
2001b). In plants, an equivalent of the yeast delta-9 desaturase is 
stearoyl-acyl carrier protein desaturase; similar to the yeast enzyme, 
a mutation in the plant desaturase lowers oleic acid levels (Kachroo 
et al., 2001). Analysis of a stearoyl-acyl carrier protein desaturase- 
defective mutant, designated ssi2, showed that it is partially resistant 
to Cucumber mosaic virus (CMV) (Takahashi et al., 2004). Second-site 
mutations, which restore oleic acid levels in ssi2 plants, were sufficient 
to restore CMV susceptibility. Thus, ssi2-mediated resistance to CMV 
is likely due to reduced levels of oleic acid. Oleic acid also has been 
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implicated in regulating other defense responses (Kachroo et al., 2001, 
2003a,b, 2004, 2005). However, oleic acid levels do not appear to be 
critical for TCV resistance since ssi2 plants are susceptible to this 
pathogen (Chandra-Shekara et al., 2004). 

Analysis of the HRT signaling pathway has revealed that light con- 
ditions also influence TCV resistance. We previously demonstrated 
that high SA levels, due to exogenously supplied SA or the presence 
of a mutation stimulating endogenous SA accumulation, enhance TCV 
resistance in an HRT-dependent, rrt-independent manner by upregu- 
lating HRT expression (Chandra-Shekara et al., 2004, 2006). Light 
plays an important role in the SA-mediated upregulation of HRT 
expression and TCV resistance. When HRT-overexpressing transgenic 
Col-0 plants were subjected to a dark period immediately following 
TCV inoculation, TCV resistance was abolished (Chandra-Shekara 
et al., 2006). Strikingly, dark treatment impaired resistance without 
affecting the levels of HRT transcript or free SA. However, increased 
expression of HRT, coupled with high endogenous SA levels, partially 
overcame the light requirement (Chandra-Shekara et al., 2006). 

Light also is required for the HR to TCV, since a dark period imme- 
diately following inoculation abolishes both the HR and PR-1 gene 
expression (Chandra-Shekara et al., 2006). Unlike resistance, the HR 
to TCV is independent of EDS1, PAD4, EDS5, and SID2. This suggests 
that, upon recognition of the pathogen, HRT induces at least two 
distinct pathways that differ in their requirement for downstream 
factors (Fig. 1). Since SA levels in HRT sid2 plants do not rise after 
TCV inoculation, both HR formation and PR-1 expression appear to be 
SA independent. In this regard, the HRT-triggered signaling pathway 
may overlap with that of its RPP8 paralog, which confers resistance to 
P. parasitica in an SA-independent manner (McDowell et al., 1998). 
TCV-induced HR and PR-1 gene expression also are independent of 
NDR1, RAR1, and SGT1. A mutation in NPR1, a positive regulator of 
SA signaling and systemic acquired resistance, partially compromises 
TCV-induced PR-1 gene expression but does not affect the HR or 
resistance (Kachroo, 2005; Kachroo et al., 2000). 


C. RCY1-Mediated Resistance to Cucumber mosaic virus 
in Arabidopsis 


The interaction between Arabidopsis ecotype C24 and CMV, a cucu- 
movirus, represents the only other Arabidopsis—virus pathosystem, 
besides the Arabidopsis—TCV system, in which viral resistance is the 
result of pathogen-induced defense responses. Inoculation of CMV on 
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plants from the resistant ecotype C24 elicits an HR on the inoculated 
leaves, which is accompanied by increased expression of the SA- 
responsive genes, PR-1 and PR-5 (Takahashi et al., 1994, 2004). Unlike 
the Arabidopsis—TCV system, the HR to CMV is associated not only 
with the induction of SA-responsive genes, but also with upregulation of 
the JA/ethylene-responsive gene PDF1.2 (encoding a plant defensin 
protein). This finding suggests that both SA- and JA/ethylene-regulated 
pathways are triggered during the resistance response to CMV. Con- 
sistent with this possibility, CMV-induced expression of the PR and 
PDF 1.2 genes was abolished by mutations in EDS5 and COI1, respec- 
tively. Both the HR and resistance to CMV are conferred by a CC-NBS- 
LRR type R protein designated RCY1 (resistance to CMV strain Y1) 
(Takahashi et al., 2002). However, unlike the HRT-TCV system, RCY1- 
mediated resistance is only partially dependent on SA and EDS5 
(Fig. 1). Furthermore, the coil mutation, which results in JA insensi- 
tivity, abolishes RCY1-induced expression of PDF 1.2 but restores CMV 
resistance in RCY1 edséd plants (Takahashi et al., 2004). This result 
suggests that PDF1.2 expression does not contribute to CMV resis- 
tance and that cross talk between SA and JA pathways modulates this 
resistance. Since PR-1 expression is abolished in RCY1 eds5 plants 
(Takahashi et al., 2004), it is likely that the partial dependence 
on EDS5 is because the eds5 mutation affects SA accumulation. Con- 
sistent with the involvement of SA in resistance, Mayers et al. 
(2005) found that exogenous application of SA delays the onset of 
systemic movement of CMV in susceptible ecotypes of Arabidopsis. 
RCY1-mediated resistance is partially dependent on ethylene but 
independent of NPR1 (Takahashi et al., 2002). 

Although the predicted proteins encoded by RCY1 and its paralogs 
HRT and RPP8 show ~90% similarity at the amino acid level, they 
confer resistance to very different pathogens. It has been hypothesized 
that intragenic recombination and positive selection are responsible 
for evolution of the HRT/RPP8/RCY1 genes (Cooley et al., 2000; 
McDowell et al., 1998). Even though RCY1, HRT, and RPP8 share a 
high level of similarity, they appear to regulate distinct resistance 
pathways that are unique to each pathosystem. Moreover, comparison 
of the TCV and CMV coat proteins (CP), which are the Avr factors for 
these viruses (Oh et al., 1995; Takahashi et al., 2001; Zhao et al., 2000), 
fails to reveal any similarity at the amino acid or DNA levels. Thus, 
HRT and RCY1 may have evolved to recognize completely different 
ligands; alternatively, the interaction between these R proteins 
and their Avr factors may involve other common accessory factors. 
Given that HRT and TCV CP interact at the genetic level, but there 
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is no biochemical evidence for their direct physical interaction, the 
latter possibility is likely to be the case (Cooley et al., 2000; Ren 
et al., 2000; Wu and Klessig, unpublished data). Moreover, HRT- 
mediated resistance requires a physical interaction between the TCV 
CP and a protein belonging to the NAC family of transcription activa- 
tors, designated TIP (TCV interacting protein) (Ren et al., 2000). These 
and other studies with various R and Avr proteins suggest that inter- 
actions between most R proteins and their corresponding Avr factors 
do not occur directly, but instead require other accessory proteins/ 
factors (Luderer and Joosten, 2001). 

While expression of an HRT transgene in susceptible Col-0 plants 
confers HR formation after TCV infection, expression of an RCY1 
transgene in a susceptible ecotype did not confer a CMV-induced HR; 
however, resistance was observed in 50% of these plants (Takahashi 
et al., 2002). This observation suggests that HR development and CMV 
resistance either are influenced by the level of RCY/ transgene expres- 
sion, or that another locus (loci) is required for HR formation and/or 
resistance. Supporting the latter possibility, F2 progeny from a cross 
between C24 and the susceptible Landsberg ecotype showed digenic 
segregation for HR formation, which suggests that a recessive gene 
influences HR development (Takahashi et al., 1994). 


D. RTM1-Mediated Resistance to Tobacco etch virus in Arabidopsis 


Resistance to Tobacco etch virus (TEV) in Arabidopsis is governed 
by the dominant gene, RTM1, which functions by blocking long 
distance movement of the virus. RTM1-mediated resistance to TEV 
differs from that of a typical R gene because it does not involve HR 
formation or induction of defense genes associated with systemic ac- 
quired resistance (SAR). RTM1-mediated resistance also is unaffected 
by expression of a nahG transgene and is independent of NPR1, EDS1, 
PAD4, and NDR1 (Mahajan et al., 1998). These observations suggest 
that RTM1-mediated signaling either acts downstream of the various 
R gene-associated functions or is independent of them. In addition 
to RTM1, another dominant locus, RTM2 is involved in restricting 
TEV long distance movement. Both RTM1 and RTM2 are specific 
to TEV, since they do not appear to function as general defense controls 
against long distance viral movement. The RTM1 gene encodes a 
lectin-like protein, which has led to the suggestion that it restricts 
TEV long distance movement either by physically blocking viral 
entry into the vascular tissue or by inhibiting a factor required for 
long distance movement. RTM2 encodes a multidomain protein con- 
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taining an N-terminal region with high similarity to that of plant small 
HSPs (Whitham et al., 2000). Although RTM2’s small HSP-like domain 
is evolutionarily distinct from plant small HSPs, the discovery that 
HSPs can act as molecular chaperones in R gene-triggered signaling 
pathways (Hubert et al., 2003; Liu et al., 2004a; Takahashi et al., 2003) 
has prompted speculation that RTM-mediated signaling is somehow 
linked with active resistance conferred by typical R genes. 


E. Rx- and Rx2-Mediated Resistance to Potato virus X (PVX) in Potato 


Resistance to PVX is conferred by either of two genes, designated Rx 
and Rx2, which encode CC-NBS-LRR type proteins. Both R proteins 
display the same specificity for the PVX CP, which acts as the Avr 
factor. The resistance response to PVX, commonly referred to as ex- 
treme resistance, involves rapid arrest of PVX replication in the initi- 
ally infected cell; this prevents viral accumulation and minimizes the 
HR to a single cell (Goulden et al., 1993; Kohm et al., 1993). However, 
an HR to PVX is observed when Rx is transiently coexpressed with the 
PVX CP in N. benthamiana (Moffett et al., 2002). Resistance signaling 
mediated by Rx is dependent on SGT1, HSP90, and a MAPKKK (Lu 
et al., 2003; Peart et al., 2002a) but is independent of EDS1 and RAR1 
(Bieri et al., 2004; Peart et al., 2002b). 


F. Nb-Mediated Resistance to PVX in Potato 


Nb is a single dominant R locus that maps to chromosome V in potato 
and confers resistance to PVX isolates from strain groups 1 and 2. 
Unlike Rx-mediated signaling, Nb gene-dependent resistance is char- 
acterized by development of an HR, which is elicited by a 25-kDa 
protein from the pathogen (Marano et al., 2002). The Rx gene is epi- 
static to Nb (Marano et al., 2002), since it masks the Nb-associated 
resistance phenotype (Cockerham, 1970). 


G. Ry- and Ny,,,-Mediated Resistance to Potato virus Y in Potato 


Resistance to Potato virus Y (PVY) in potato is conferred by two 
different R genes, Ry and Ny,z,. Ry is a single dominant gene conferring 
extreme resistance to PVY, while Ny,,, confers HR-associated resis- 
tance. Ry and Ny,,, map on chromosomes XI and IV of potato, respec- 
tively (Hamalainen et al., 1997; Leister et al., 1996). Plants carrying 
Ny: exhibit both local and systemic HR to PVY. This systemic HR has 
been attributed to a low affinity interaction between the elicitor (Avr 
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factor) and the receptor (R protein) and/or when elicitor is produced at a 
late stage in the infection cycle (Celebi-Toprak et al., 2002). The Ry gene 
has been mapped in two different species of potato, and the identified 
genes were designated Ryaag and Rysto and mapped to chromosomes XI 
and XII, respectively (Brigneti et al., 1997; Song et al., 2005). A coding 
region designated Y-1, which was found to co-segregate with Ryaag, has 
been cloned and is predicted to encode a TIR-NBS-LRR class of resis- 
tance protein. Expression of the Y-1 transgene in a susceptible cultivar 
results in development of an HR to PVY, but the transgenic plants failed 
to prevent systemic viral spread (Vidal et al., 2002). The Avr factor for 
PVY appears to be the protease domain of the nuclear inclusion protein, 
NlaPro, since this protein is able to elicit Ry-mediated resistance 
responses. However, evidence suggests that a host factor(s) also plays 
a role in eliciting resistance responses (Mestre et al., 2003). 


H. Sw-5-Mediated Resistance to Tomato spotted wilt virus 
in Tomato 


Resistance to Tomato spotted wilt virus (TSWV) is conferred by Sw-5, 
which encodes a CC-NBS-LRR class of resistance protein. Resistance to 
TSWV is associated with development of an HR, which helps prevent 
the systemic spread of the virus (Jansen and Lenoire, 1998). Sw-5 
shares a high level of similarity with another tomato R gene, Mi, which 
confers resistance to nematodes (Brommonschenkel et al., 2000). 

Sw-5 gene confers broad spectrum resistance to TSWV isolates col- 
lected from different geographical regions as well as to two other 
tospoviruses, Tomato chlorotic spot virus and Groundnut ring spot 
virus, which infect tomato (Boiteux and Giordano, 1993; Stevens 
et al., 1992). In addition to Sw-5, tomato contains five different genes 
(two dominant, three recessive; Sw-al, Sw-b1, Sw-2, Sw-3, and Sw-4) 
that confer resistance to various isolates of TSWV. All five genes confer 
isolate-specific resistance, which can be easily overcome by different 
TSWV isolates and other tospoviruses (Boiteux and Giordano, 1993; 
Stevens et al., 1992). 


I. Tm-2?-Mediated Resistance to Tomato mosaic virus in Tomato 


Resistance to the tobamovirus, Tomato mosaic virus (ToMV), is con- 
ferred by three dominant R genes (Tm-1, Tm-2, and Tm-2”) that are 
present in different tomato accessions and are widely used in tomato 
breeding to protect against ToMV. Tm-1 was introgressed into the 
cultivated species (Lycopersicum esculentum) from its distant relative, 
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L. hirsutum, while Tm-2 and Tm-2? were introgressed from L. peruvia- 
num (Hall, 1980; Pelham, 1966). The Tm-2 and Tm-2” genes encode CC- 
NBS-LRR-type R proteins and are considered to be allelic to each other 
(Lanfermeijer et al., 2005; Tanksley et al., 1992). Tm-2?-conferred resis- 
tance is more durable than that conferred by Tm-2 (Fraser et al., 1989); 
this difference in durability has been attributed to four amino acid differ- 
ences between these proteins (Lanfermeijer et al., 2005). Both Tm-2 and 
Tm-2” recognize the movement protein of ToMV as the Avr factor (Calder 
and Palukaitis, 1992; Meshi et al., 1989; Weber and Pfitzner, 1998). 


J. Rsv1-Mediated Resistance to Soybean mosaic virus in Soybean 


Soybean Rsv1 is a single-locus, multi-allelic gene conferring resis- 
tance to Soybean mosaic virus (SMV). Different Rsv1 alleles condition 
diverse responses to the seven strains of SMV (G1-G7), ranging from 
extreme resistance and necrosis to mosaic symptoms. These genes 
encode a CC-NBS-LRR class of R protein. The 3gG2 gene within the 
Rsv1 locus is a strong candidate for the major Rsv1 gene mediating 
resistance to SMV (Hayes et al., 2004). 


K. L Locus-Mediated Resistance to Tobamoviruses in Pepper 


Resistance to tobamoviruses is governed by four dominant genes, L7— 
L* in Capsicum annum. These genes are allelic and can be differen- 
tiated based on the induction of an HR by specific strains of tobamo- 
viruses (Boukema, 1982; Boukema et al., 1980). L? gene-mediated 
resistance is most effective against all but the Pepper mild mottle 
tobamovirus strain (Garcia-Luque et al., 1993). Unlike most R gene- 
mediated responses, L?-mediated resistance shows a strong dosage- 
dependent response. When L? is homozygous, it confers extreme 
resistance. By contrast, plants heterozygous for L? do not exhibit 
an HR and are mildly susceptible (Boukema et al., 1980). 


III. OTHER SIGNALING MECHANISMS CONTRIBUTING TO VIRAL RESISTANCE 


A. Pathogen-Derived Resistance: Recruiting Part of the Enemy 
for Defense 


Resistance to viral pathogens also can be achieved by expressing a 
portion of viral genetic material in the plant. This form of resistance, 
popularly known as “pathogen-derived resistance,” is a commonly used 
approach to engineer viral resistance. For example, expression of the 
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TMV CP gene in tobacco blocks the disassembly of infecting virion 
thereby preventing replication and movement of TMV (Powell-Abel 
et al., 1986; Register and Beachy, 1988). This CP-mediated resistance 
(CPMR) is effective against virions but ineffective against unencapsi- 
dated virus or high inoculation titer. CPMR has been shown to work 
against a number of RNA viruses including PVX, Alfalfa mosaic virus 
(AIMV), CMV, TEV, Potato mop-top virus, Papaya ringspot virus, and 
Tobacco ringspot virus (Chapman et al., 1992; reviewed in Barker et al., 
1998; Baulcombe, 1996; Beachy, 1994; Ferreira et al., 2002; Wilson, 1993; 
Zadeh and Foster, 2004). Pathogen-derived resistance also can be 
achieved by expression of defective movement proteins, defective repli- 
case subunits, or symptom-suppressing viral satellite RNAs (sat-RNAs) 
(Anderson et al., 1992; Audy et al., 1994; Baulcombe, 1994; Brederode 
et al., 1995; Carr et al., 1994; Lapidot et al., 1993; Lomonossoff, 1995; 
Palukaitis and Zaitlin, 1997). Movement protein-mediated resistance 
has an advantage over CPMR since it provides resistance to several 
viruses. For example, expression of a defective movement protein of 
TMV confers resistance to tobamo-, cucumo-, potex-, and tobra-viral 
groups (Cooper et al., 1995). A peptide-mediated resistance strategy has 
been developed that involves expressing a 29-amino acid peptide in 
plants (Rudolph et al., 2003). This peptide, which was derived from the 
TSWV nucleocapsid protein, interacts with nucleoproteins from several 
members of the tospovirus group. Transgenic plants expressing this 
peptide exhibit enhanced resistance to TSWV, Tomato chlorotic spot 
virus, Groundnut ring spot virus, and Chrysanthemum stem necrosis 
virus (Rudolph et al., 2003). 


B. Viral Gene Silencing: A Tug-of-War Between Host and Pathogen 


Another way to achieve resistance to viral pathogens containing 
an RNA genome is to selectively target the viral RNA for degradation. 
When an RNA virus infects a host cell, it can activate a host defense 
mechanism known as RNA silencing (Ratcliff et al., 1999). RNA 
silencing is an epigenetic phenomenon manifested as posttranscriptional 
gene silencing (PTGS) in plants; it provides a highly specific, efficient 
surveillance system against parasitic RNA. Mechanistically, RNA 
silencing is triggered by formation of double-stranded RNA (dsRNA) 
(Fire et al., 1998; Klahre et al., 2002; Metzlaff et al., 1997; Waterhouse 
et al., 1998), which is subsequently cleaved by a dsRNA-specific RNase 
(commonly known as dicer) to produce small guide molecules 21-24 
nucleotides long, termed short interfering RNAs (siRNAs) (Hamilton 
and Balcombe, 1999; Martinez et al., 2002). The antisense strand of the 
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siRNA associates with an RNAi silencing complex to target homologous 
RNA for degradation (reviewed in Bartel, 2004; Baulcombe, 1999; Chicas 
and Macino, 2001; Matzke et al., 2001; Vance and Vaucheret, 2001; 
Waterhouse et al., 2001). The RNA silencing signal is also transported 
to other cells in the form of siRNAs, through plasmodesmata and phloem, 
to trigger silencing throughout the plant (Mlotshwa et al., 2002; Yooet al., 
2004). The mechanism of RNA silencing is highly conserved in plants and 
animals and is implicated in viral resistance (Covey et al., 1997; Ratcliff 
et al., 1997), genome maintenance (Assaad et al., 1993), and developmen- 
tal control (Boerjan et al., 1994). 

Silencing and degradation of viral RNA can be engineered by expres- 
sing a portion of viral RNA in the host genome. For example, expression 
of anontranslatable version of the TEV CP in tobacco confers resistance 
to subsequent TEV infection. Resistance to TEV, due to silencing of the 
CP, is specific to TEV and does not provide resistance to other viruses 
such as CMV, AIMV, or PVY (Lindbo and Dougherty, 1992; Lindbo et al., 
1993). Another method for engineering viral resistance exploits the 
effect of sat-RNAs, which are subgenomic viral RNAs that depend on 
the viral genomic RNA (helper virus) for replication, encapsidation, and 
movement. Some sat-RNAs are known to either attenuate or exacerbate 
disease symptoms caused by their helper viruses (Kong et al., 1997; 
Roossinck et al., 1992). Thus, expression of sat-RNAs in transgenic 
tobacco has been used to confer resistance against Groundnut rosette 
virus (Taliansky et al., 1998). Similarly, resistance to sat-RNA-free 
strains of CMV was obtained when a benign variant of a CMV 
sat-RNA was expressed in tomato (Cillo et al., 2004). 

In addition to using portions of the viral genome or sat-RNAs, resis- 
tance has been engineered using constructs encoding an intron-spliced 
RNA with a hairpin loop structure. The PTGS mediated by hairpin 
RNA (hpRNA) was stable and more efficient than PTGS mediated by 
expressing sense or antisense constructs (Smith et al., 2000). Transient 
expression of hpRNAs containing sequences derived from viral patho- 
gens has been shown to effectively confer resistance. For example, 
tobacco infiltrated with Agrobacterium carrying hpRNA derived from 
Pepper mild mottle virus were resistant to subsequent infection by 
this virus (Tenllado et al., 2004). In a novel study, expression of self- 
complementary hpRNA in tobacco was placed under control of the rolC 
promoter of Agrobacterium rhizogenes. The rolC promoter is tissue 
specific as it is expressed only in vascular tissues (Schmiilling et al., 
1989; Sugaya and Uchimiya, 1992); the resultant plants displayed 
systemic resistance to Plum pox virus (Pandolfini et al., 2003). 
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Plant viruses have evolved counter defenses to overcome RNA 
silencing in their host. One strategy is to encode proteins that suppress 
host-mediated RNA silencing thereby facilitating replication and sys- 
temic movement of the virus (Anandalakshmi et al., 1998; Brigneti 
et al., 1998; Kasschau and Carrington, 1998; Mallory et al., 2001; 
Soosaar et al., 2005; Voinnet, 2001, 2005). An interesting feature of 
these viral suppressors of host-mediated RNA silencing is that they do 
not share obvious sequence or structural similarity to one another. 
Furthermore, suppressor activity has been identified in a variety of 
viral proteins that serve different functions, including structural pro- 
teins (such as CP) and nonstructural proteins (such as replicases). This 
suggests that several different mechanisms are involved in suppres- 
sing RNA silencing. Suppressors of RNA silencing appear to function 
in both animal and plant cells, regardless of the host origin of the virus 
(Dunoyer et al., 2004; Lakatos et al., 2004). Thus, the silencing 
machinery and/or host signaling leading to RNA silencing appear to 
be conserved between plants and animals. 


IV. CONCLUSIONS 


The past decade has witnessed a substantial growth in our under- 
standing of R gene signaling and RNA silencing pathways. While it is 
probable that these pathways communicate with each during the devel- 
opment of an effective resistance response to viral attack, the evidence 
to support such a link is rather scarce. A possible connection between 
resistance mediated by R genes and RNA silencing is indicated by the 
observation that in certain host—virus interactions (e.g., Di-17 and 
TCV) the same viral protein can function as both an Avr factor and a 
suppressor of RNA silencing. In addition, the ability of SA to induce host 
RdRps, which are components of the RNA silencing pathway, further 
suggests an overlap between these pathways. Improved understanding 
of R gene-mediated resistance and the RNA silencing pathways should 
facilitate better control of viral diseases by making it possible to further 
manipulate and enhance resistance in plants. 
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ABSTRACT 


Coronaviruses are large, enveloped RNA viruses of both medical and 
veterinary importance. Interest in this viral family has intensified in 
the past few years as a result of the identification of a newly emerged 
coronavirus as the causative agent of severe acute respiratory syn- 
drome (SARS). At the molecular level, coronaviruses employ a variety 
of unusual strategies to accomplish a complex program of gene expres- 
sion. Coronavirus replication entails ribosome frameshifting during 
genome translation, the synthesis of both genomic and multiple sub- 
genomic RNA species, and the assembly of progeny virions by a path- 
way that is unique among enveloped RNA viruses. Progress in the 
investigation of these processes has been enhanced by the development 
of reverse genetic systems, an advance that was heretofore obstructed 
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by the enormous size of the coronavirus genome. This review sum- 
marizes both classical and contemporary discoveries in the study of the 
molecular biology of these infectious agents, with particular emphasis 
on the nature and recognition of viral receptors, viral RNA synthesis, 
and the molecular interactions governing virion assembly. 


I. INTRODUCTION 


Coronaviruses are a family of enveloped RNA viruses that are 
distributed widely among mammals and birds, causing principally 
respiratory or enteric diseases but in some cases neurologic illness or 
hepatitis (Lai and Holmes, 2001). Individual coronaviruses usually 
infect their hosts in a species-specific manner, and infections can be 
acute or persistent. Infections are transmitted mainly via respiratory 
and fecal-oral routes. The most distinctive feature of this viral family is 
genome size: coronaviruses have the largest genomes among all RNA 
viruses, including those RNA viruses with segmented genomes. This 
expansive coding capacity seems to both provide and necessitate a 
wealth of gene-expression strategies, most of which are incompletely 
understood. 

Two prior reviews with the same title as this one have appeared 
in the Advances in Virus Research series (Lai and Cavanagh, 1997; 
Sturman and Holmes, 1983). The earlier of the two noted that the 
recognition of coronaviruses as a separate virus family occurred in 
the 1960s, in the wake of the discovery of several new human respira- 
tory pathogens, certain of which, it was realized, appeared highly 
similar to the previously described avian infectious bronchitis virus 
(IBV) and mouse hepatitis virus (MHV) (Almeida and Tyrrell, 1967). 
These latter viruses had a characteristic morphology in negative-stained 
electron microscopy, marked by a “fringe” of surface structures described 
as “spikes” (Berry et al., 1964) or “club-like” projections (Becker et al., 
1967). Such structures were less densely distributed and differently 
shaped than those of the myxoviruses. To some, the fringe resembled 
the solar corona, giving rise to the name that was ultimately assigned to 
the group (Almeida et al., 1968). Almost four decades later, recognition of 
the same characteristic virion morphology alerted the world to the emer- 
gence of another new human respiratory pathogen: the coronavirus 
responsible for the devastating outbreak of severe acute respiratory 
syndrome (SARS) in 2002-2003 (Ksiazek et al., 2003; Peiris et al., 
2003). The sudden appearance of SARS has stimulated a burst of new 
research to understand the basic replication mechanisms of members of 
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this family of viral agents, as a means toward their control and prophy- 
laxis. Thus, the time is right to again assess the state of our collective 
knowledge about the molecular biology of coronaviruses. 

Owing to limitations imposed by both space and the expertise of the 
author, “molecular biology” will be considered here in the more narrow 
sense, that is, the molecular details of the cellular replication of coro- 
naviruses. No attempt will be made to address matters of pathogenesis, 
viral immunology, or epidemiology. For greater depth and differences of 
emphasis in particular areas, as well as for historical perspectives, the 
reader is referred to the two excellent predecessors of this review (Lai 
and Cavanagh, 1997; Sturman and Holmes, 1983) and also to volumes 
edited by Siddell (1995) and Enjuanes (2005). 


II. TAXONOMY 


Coronaviruses are currently classified as one of the two genera in 
the family Coronaviridae (Enjuanes et al., 2000b). However, it is likely 
that the coronaviruses, as well as the other genus within the 
Coronaviridae, the toroviruses (Snijder and Horzinek, 1993), will each be 
accorded the taxonomic status of family in the near future (Gonzalez et al., 
2003). Therefore, throughout this review, the coronaviruses are referred 
to as a family. Both the coronaviruses and the toroviruses, in addition to 
two other families, the Arteriviridae (Snijder and Meulenberg, 1998) 
and the Roniviridae (Cowley et al., 2000; Dhar et al., 2004), have been 
grouped together in the order Nidovirales. This higher level of organiza- 
tion recognizes a relatedness among these families that sets them apart 
from other nonsegmented positive-strand RNA viruses. The most salient 
features that all nidoviruses have in common are: gene expression through 
transcription of a set of multiple 3’-nested subgenomic RNAs; expression of 
the replicase polyprotein via ribosomal frameshifting; unique enzymatic 
activities among the replicase protein products; a virion membrane enve- 
lope; and a multispanning integral membrane protein in the virion. The 
first of these qualities provides the name for the order, which derives from 
the Latin nido for nest (Enjuanes et al., 2000a). In contrast to their 
commonalities, however, nidovirus families differ from one another in 
distinct ways, most conspicuously in the numbers, types, and sizes of the 
structural proteins in their virions and in the morphologies of their nucleo- 
capsids. A more detailed comparison of characteristics of these 
virus families has been given by Enjuanes et al. (2000b) and Lai and 
Cavanagh (1997). 


196 PAUL S. MASTERS 


Members of the coronavirus family have been sorted into three 
groups (Table I), which, it has been proposed, are sufficiently divergent 
to merit the taxonomic status of genera (Gonzalez et al., 2003). Classi- 
fication into groups was originally based on antigenic relationships. 
However, such a criterion reflects the properties of a limited subset of 
viral proteins, and cases have arisen where clearly related viruses in 
group 1 were found not to be serologically cross-reactive (Sanchez 
et al., 1990). Consequently, sequence comparisons of entire viral gen- 
omes (or of as much genomic sequence as is available) have come to be 
the basis for group classification (Gorbalenya et al., 2004). Almost all 
group 1 and group 2 viruses have mammalian hosts, with human 
coronaviruses falling into each of these groups. Viruses of group 3, by 
contrast, have been isolated solely from avian hosts. Most of the cor- 
onaviruses in Table I have been studied for decades, and, by the turn of 
the century, the scope of the family seemed to be fairly well-defined. 
Accordingly, it came as quite a shock, in 2003, when the causative 
agent of SARS was found to be a coronavirus (SARS-CoV). Equally 
astonishing have been the outcomes of renewed efforts, following the 
SARS epidemic, to detect previously unknown viruses; these investi- 
gations have led to the discovery of two more human respiratory 
coronaviruses, HCoV-NL63 (van der Hoek et al., 2004) and HCoV- 
HKU 1 (Woo et al., 2005). Three distinct bat coronaviruses have also 
been isolated: two are members of group 1, and the third, in group 2, is 
a likely precursor of the human SARS-CoV (Lau et al., 2005; Li et al., 
2005c; Poon et al., 2005). In addition, new IBV-like viruses have been 
found that infect geese, pigeons, and ducks (Jonassen et al., 2005). 

In almost all cases, the assignment of a coronavirus species to a given 
group has been unequivocal. Exceptionally, the classification of SARS- 
CoV has provoked considerable controversy. The original, unrooted, 
phylogenetic characterizations of the SARS-CoV genome sequence pos- 
ited this virus to be roughly equidistant from each of the three previ- 
ously established groups. It was thus proposed to be the first recognized 
member of a fourth group of coronaviruses (Marra et al., 2003; Rota 
et al., 2003). However, a subsequently constructed phylogeny based on 
gene 1b, which contains the viral RNA-dependent RNA polymerase and 
which was rooted in the toroviruses as an outgroup, concluded that 
SARS-CoV is most closely related to the group 2 coronaviruses (Snijder 
et al., 2003). In the same vein, it was noted that regions of gene Ja of 
SARS-CoV contain domains that are unique to the group 2 corona- 
viruses (Gorbalenya et al., 2004). Other analyses of a subset of structur- 
al gene sequences (Hickmann et al., 2003) and of RNA secondary 
structures in the 3’ untranslated region (3’ UTR) of the genome (Goebel 
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TABLE I 
CORONAVIRUS SPECIES AND GROUPS 
GenBank 
accession 
Group Designation Species Host number* 
1 TGEV Transmissible Pig AJ271965 [g] 
gastroenteritis virus 
PRCoV Porcine respiratory Pig Z24675 [p] 
coronavirus 
FIPV Feline infectious Cat AY994055 [g] 
peritonitis virus 
FCoV Feline enteric Cat Y13921 [p] 
coronavirus 
CCoV Canine coronavirus Dog D13096 [p] 
HCoV-229E Human coronavirus Human AF304460 [g] 
strain 229E 
PEDV Porcine epidemic Pig AF353511 [g] 
diarrhea virus 
HCoV-NL63 Human coronavirus Human AY567487 [g] 
strain NL63 
Bat-CoV-61 Bat coronavirus Bat AY864196 [p] 
strain 61 
Bat-CoV-HKU2 Bat coronavirus strain Bat AY594268 [p] 
HKU2 
2 MHV Mouse hepatitis virus Mouse AY700211 [g] 
BCoV Bovine coronavirus Cow U00735 [g] 
RCoV Rat coronavirus Rat AF088984 [p] 
SDAV Sialodacryoadenitis Rat AF207551 [p] 
virus 
HCoV-OC43 Human coronavirus Human AY903460 [g] 
strain OC43 
HEV Hemagglutinating Pig AF481863 [p] 
encephalomyelitis 
virus 
PCovt Puffinosis coronavirus Puffin AJ544718 [p] 
ECoV Equine coronavirus Horse AY316300 [p] 
CRCoV Canine respiratory Dog CQ772298 [p] 
coronavirus 
SARS-CoV SARS coronavirus Human AY278741 [g] 
HCoV-HKU1 Human coronavirus Human AY597011 [g] 
strain HKU1 
Bat-SARS-CoV Bat SARS coronavirus Bat DQ022305 [g] 





(continues) 
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TABLE I (continued) 





GenBank 
accession 
Group Designation Species Host number* 
3 IBV Infectious bronchitis Chicken AJ311317 [g] 
virus 
TCoV Turkey coronavirus Turkey AY342357 [p] 
PhCoV Pheasant coronavirus Pheasant AJ618988 [p] 
GCoV Goose coronavirus Goose AJ871017 [p] 
PCov' Pigeon coronavirus Pigeon AJ871022 [p] 
DCoV Duck coronavirus Mallard AJ871024 [p] 





“ One representative GenBank accession number is given for each species. When 
available, a complete genomic sequence (denoted [g]) is provided; otherwise, the largest 
available partial sequence (denoted [p]) is given. 

t Unique designations have not yet been formulated for these two viruses. 


et al., 2004b) also supported a group 2 assignment. By contrast, some 
authors have argued, based on bioinformatics methods, that the ancestor 
of SARS-CoV was derived from multiple recombination events among 
progenitors from all three groups (Rest and Mindell, 2003; Stanhope 
et al., 2004; Stavrinides and Guttman, 2004). While these latter studies 
assume that historically there has been limitless opportunity for inter- 
group recombination, there is no well-documented example of recombi- 
nation between extant coronaviruses of different groups. Moreover, it is 
not clear that intergroup recombination is even possible, owing to repli- 
cative incompatibilities among the three coronavirus groups (Goebel 
et al., 2004b). Therefore, although SARS-CoV does indeed have unique 
features, the currently available evidence best supports the conclusion 
that it is more closely allied with the group 2 coronaviruses and that it 
has not sufficiently diverged to constitute a fourth group (Gorbalenya 
et al., 2004). 


III. Virion MORPHOLOGY, STRUCTURAL PROTEINS, AND ACCESSORY PROTEINS 


A. Virus and Nucleocapsid 


Coronaviruses are roughly spherical and moderately pleiomorphic 
(Fig. 1). Virions have typically been reported to have average dia- 
meters of 80-120 nm, but extreme sizes as small as 50 nm and as large 
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Fic 1. Schematic of the coronavirus virion, with the minimal set of structural 
proteins. 


as 200 nm are occasionally given in the older literature (Oshiro, 1973; 
McIntosh, 1974). The surface spikes or peplomers of these viruses, vari- 
ously described as club-like, pear-shaped, or petal-shaped, project some 
17-20 nm from the virion surface (McIntosh, 1974), having a thin base 
that swells to a width of about 10 nm at the distal extremity (Sugiyama 
and Amano, 1981). For some coronaviruses a second set of projections, 
5-10-nm long, forms an undergrowth beneath the major spikes (Guy 
et al., 2000; Patelet al., 1982; Sugiyama and Amano, 1981). These shorter 
structures are now known to be the hemagglutinin-esterase (HE) protein 
that is found in a subset of group 2 coronaviruses (Section III.G). 

At least some of the heterogeneity in coronavirus particle morphology 
can be attributed to the distorting effects of negative-staining proce- 
dures. Freeze-dried (Roseto et al., 1982) and cryo-electron microscopic 
(Risco et al., 1996) preparations of BCoV and TGEYV, respectively, 
showed much more homogeneous populations of virions, with diameters 
10-30 nm greater than virions in comparable samples prepared by 
negative staining. Extraordinary three-dimensional images have been 
obtained for SARS-CoV virions emerging from infected Vero cells (Ng 
et al., 2004). These scanning electron micrographs and atomic force 
micrographs reveal knobby, rosette-like viral particles resembling tiny 
cauliflowers. It will be exciting to see future applications of advanced 
imaging techniques to the study of coronavirus structure. 

The internal component of the coronavirus virion is obscure in elec- 
tron micrographs of whole virions. In negative-stained images the 
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core appears as an indistinct mass with a densely staining center, 
giving the virion a “punched-in” spherical appearance. Imaging of 
virions that have burst spontaneously, expelling their contents, or 
that have been treated with nonionic detergents has allowed visuali- 
zation of the coronavirus core. Such analyses led to the attribution 
of another distinguishing characteristic to the coronavirus family: that 
its members possess helically symmetric nucleocapsids. Such nucleo- 
capsid symmetry is the rule for negative-strand RNA viruses, but 
almost all positive-strand RNA animal viruses have icosahedral ribo- 
nucleoprotein capsids. However, although it is fairly well accepted that 
coronaviruses have helical nucleocapsids, there are surprisingly few 
published data that bear on this issue. Additionally, the reported results 
vary considerably with both the viral species and the method of prepara- 
tion. The earliest study of nucleocapsids from spontaneously disrupted 
HCoV-229E virions found tangled, threadlike structures 8-9 nm in diam- 
eter; these were unraveled or clustered to various degrees and, in rare 
cases, retained some of the shape of the parent virion (Kennedy and 
Johnson-Lussenburg, 1975/76). A subsequent analysis of spontaneously 
disrupted virions of HCoV-229E and MHV observed more clearly helical 
nucleocapsids, with diameters of 14-16 nm and hollow cores of 3-4 nm 
(Macnaughton et al., 1978). The most highly resolved images of any 
coronavirus nucleocapsid were obtained with NP-40-disrupted HCoV- 
229E virions (Caul et al., 1979). These preparations showed filamentous 
structures 9-11 or 11-13 nm in diameter, depending on the method of 
staining, with a 3—4-nm central canal. The coronavirus nucleocapsid was 
noted to be thinner in cross-section than those of paramyxoviruses and 
also to lack the sharply segmented “herringbone” appearance character- 
istic of paramyxovirus nucleocapsids. By contrast, in early studies, IBV 
and TGEV nucleocapsids were refractory to the techniques that had been 
successful with other viruses. Visualization of IBV nucleocapsids, which 
seemed to be very sensitive to degradation (Macnaughton et al., 1978), 
was finally achieved by electron microscopy of viral samples prepared by 
carbon-platinum shadowing (Davies et al., 1981). This revealed linear 
strands, some as long as 6-7 um, which were only 1.5-nm thick, suggest- 
ing that they represented unwound helices. TGEV, on the other hand, 
was found to be more resistant to nonionic detergents. Treatment of 
virions of this species with NP-40 resulted in spherical subviral particles 
with no threadlike substructure visible (Garwes et al., 1976). The TGEV 
core was later seen as a spherically symmetric, possibly icosahedral, 
superstructure that only dissociated further into a helical nucleocapsid 
following Triton X-100 treatment of virions (Risco et al., 1996). Such a 
collection of incomplete and often discrepant results makes it clear that 
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much further examination of the internal structure of coronavirus vir- 
ions is warranted. It would substantially aid our understanding of coro- 
navirus structure and assembly if we had available a detailed description 
of nucleocapsid shape, length, diameter, helical repeat distance, and 
protein:RNA stoichiometry. 


B. Spike Protein (S) 


There are three protein components of the viral envelope (Fig. 1). 
The most prominent of these is the S glycoprotein (formerly called E2) 
(Cavanagh, 1995), which mediates receptor attachment and viral and 
host cell membrane fusion (Collins et al., 1982). The S protein is a very 
large, N-exo, C-endo transmembrane protein that assembles into tri- 
mers (Delmas and Laude, 1990; Song et al., 2004) to form the distinc- 
tive surface spikes of coronaviruses (Fig. 2). S protein is inserted into 
the endoplasmic reticulum (ER) via a cleaved, amino-terminal signal 
peptide (Cavanagh et al., 1986b). The ectodomain makes up most of the 
molecule, with only a small carboxy-terminal segment (of 71 or fewer of 
the total 1162-1452 residues) constituting the transmembrane domain 
and endodomain. Monomers of S protein, prior to glycosylation, 
are 128-160 kDa, but molecular masses of the glycosylated forms of 


S1 
Transmembrane 
Signal peptide domain 
S1 y S2 
N c 
m MH F HR1 HR2 
HR1 S2 SARS-CoV — 
HCoV-229E m Endodomain 
HR2 RBD TGEV mumm 
SO) | 





Fic 2. The spike (S) protein. At the right is a linear map of the protein, denoting the 
amino-terminal S1 and the carboxy-terminal S2 portions of the molecule. The arrow- 
head marks the site of cleavage for those S proteins that become cleaved by cellular 
protease(s). The signal peptide and regions of mapped receptor-binding domains (RBDs) 
are shown in S1. The heptad repeat regions (HR1 and HR2), putative fusion peptide (F), 
transmembrane domain, and endodomain are indicated in S2. At the left is a model for 
the S protein trimer. 
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full-length monomers fall in the range of 150—200 kDa. The S molecule 
is thus highly glycosylated, and this modification is exclusively 
N-linked (Holmes et al., 1981; Rottier et al., 1981). S protein ecto- 
domains have from 19 to 39 potential consensus glycosylation sites, 
but a comprehensive mapping of actual glycosylation has not yet been 
reported for any coronavirus. A mass spectrometric analysis of the 
SARS-CoV S protein has shown that at least 12 of the 23 candidate 
sites are glycosylated in this molecule (Krokhin et al., 2003). For the 
TGEV S protein, it has been demonstrated that the early steps of 
glycosylation occur cotranslationally, but that terminal glycosylation 
is preceded by trimerization, which can be rate-limiting in S protein 
maturation (Delmas and Laude, 1990). In addition, glycosylation of 
TGEV S may assist monomer folding, given that tunicamycin inhibi- 
tion of high-mannose transfer was found to also block trimerization. 

The S protein ectodomain has between 30 and 50 cysteine residues, 
and within each coronavirus group the positions of cysteines are well 
conserved (Abraham et al., 1990; Eickmann et al., 2003). However, as 
with glycosylation, a comprehensive mapping of disulfide linkages has 
not yet been achieved for any coronavirus S protein. 

In most group 2 and all group 3 coronaviruses, the S protein is 
cleaved by a trypsin-like host protease into two polypeptides, S1 and 
S2, of roughly equal sizes. Even for uncleaved S proteins, that is, those 
of the group 1 coronaviruses and SARS-CoV, the designations S1 and 
S2 are used for the amino-terminal and carboxy-terminal halves of the 
S protein, respectively. Peptide sequencing has shown that cleavage 
occurs following the last residue in a highly basic motif: RRFRR in 
IBV S protein (Cavanagh et al., 1986b), RRAHR in MHV strain A59 
S protein (Luytjes et al., 1987), and KRRSRR in BCoV S protein 
(Abraham et al., 1990). Similar cleavage sites are predicted from the 
sequences of other group 2 S proteins, except that of SARS-CoV. It has 
been noted that the S protein of MHV strain JHM has a cleavage motif 
(RRARR) more basic than that found in MHV strain A59 (RRAHR). An 
expression study has shown that this difference accounts for the al- 
most total extent of cleavage of the JHM S protein that is seen in cell 
lines in which the A59 S protein undergoes only partial cleavage (Bos 
et al., 1995). 

The S1 domain is the most divergent region of the molecule, both 
across and within the three coronavirus groups. Even among strains 
and isolates of a single coronavirus species, the sequence of S1 can 
vary extensively (Gallagher et al., 1990; Parker et al., 1989; Wang 
et al., 1994). By contrast, the most conserved part of the molecule 
across the three coronavirus groups is a region that encompasses the 
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S2 portion of the ectodomain, plus the start of the transmembrane 
domain (de Groot et al., 1987). An early model for the coronavirus 
spike, which has held up well in light of subsequent work, proposed 
that the S1 domains of the S protein oligomer constitute the bulb 
portion of the spike. The stalk portion of the spike, on the other hand, 
was envisioned to be a coiled-coil structure, analogous to that in influ- 
enza HA protein, formed by association of heptad repeat regions of the 
S2 domains of monomers (de Groot et al., 1987). The roles of these two 
regions of the S protein in the initiation of infection will be discussed 
(Section IV.A). 


C. Membrane Protein (M) 


The M glycoprotein (formerly called E1) is the most abundant con- 
stituent of coronaviruses (Sturman, 1977; Sturman et al., 1980) and 
gives the virion envelope its shape. The preglycosylated M polypeptide 
ranges in size from 25 to 30 kDa (221-262 amino acids), but multiple 
higher-molecular-mass glycosylated forms are often observed by SDS- 
PAGE (Krijnse Locker et al., 1992a). The M protein of MHV has also 
been noted to multimerize under standard conditions of SDS-PAGE 
(Sturman, 1977). 

M is a multispanning membrane protein with a small, amino- 
terminal domain located on the exterior of the virion, or, intracellularly, 
in the lumen of the ER (Fig. 3). The ectodomain is followed by three 
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Fic 3. The membrane (M), envelope (E), and nucleocapsid (N) proteins. At the right 
are linear maps of the proteins, denoting known regions of importance, including trans- 
membrane (tm) domains. At the left are models for the three proteins. 
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transmembrane segments and then a large carboxy terminus compris- 
ing the major part of the molecule. This latter domain is situated in the 
interior of the virion or on the cytoplasmic face of intracellular mem- 
branes (Rottier, 1995). M proteins within each coronavirus group are 
moderately well conserved, but they are quite divergent across the 
three groups. The region of M protein showing the most conservation 
among all coronaviruses is a segment of some 25 residues encompassing 
the end of the third transmembrane domain and the start of the en- 
dodomain; a portion of this segment even retains homology to its tor- 
ovirus counterpart (den Boon et al., 1991). The ectodomain, which is the 
least conserved part of the M molecule, is glycosylated. For most group 
2 coronaviruses, glycosylation is O-linked, although two exceptions to 
this pattern are MHV strain 2 (Yamada et al., 2000) and SARS-CoV (Nal 
et al., 2005), both of which have M proteins with N-linked carbohydrate. 
Group 1 and group 3 coronavirus M proteins, by contrast, exhibit 
N-linked glycosylation exclusively (Cavanagh and Davis, 1988; Garwes 
etal., 1984; Jacobs et al., 1986; Stern and Sefton, 1982). At the time ofits 
discovery in the MHV M protein, O-linked glycosylation had not previ- 
ously been seen to occur in a viral protein (Holmes et al., 1981), and 
MHV M has since been used as a model to study the sites and mecha- 
nism of this type of posttranslational modification (de Haan et al., 
1998b; Krijnse Locker et al., 1992a; Niemann et al., 1982). Although 
the roles of M protein glycosylation are not fully understood, the glyco- 
sylation status of M can influence both organ tropism in vivo and the 
capacity of some coronaviruses to induce alpha interferon in vitro 
(Charley and Laude, 1988; de Haan et al., 2003a; Laude et al., 1992). 
The coronavirus M protein was the first polytopic viral membrane 
protein to be described (Armstrong et al., 1984; Rottier et al., 1984), 
and the atypical topology of the MHV and IBV M proteins was exam- 
ined in considerable depth in cell-free translation and cellular expres- 
sion studies. For both of these M proteins, the entire ectodomain was 
found to be protease sensitive. However, at the other end of the mole- 
cule, no more than 20-25 amino acids could be removed from the 
carboxy terminus by protease treatment (Cavanagh et al., 1986a; 
Mayer et al., 1988; Rottier et al., 1984, 1986). This pattern suggested 
that almost all of the endodomain of M is tightly associated with the 
surface of the membrane or that it has an unusually compact structure 
that is refractory to proteolysis (Rottier, 1995). Most M proteins do not 
possess a cleaved amino-terminal signal peptide (Cavanagh et al., 
1986b; Rottier et al., 1984), and for both IBV and MHV it was demon- 
strated that either the first or the third transmembrane domain alone 
is sufficient to function as the signal for insertion and anchoring of the 
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protein in its native orientation in the membrane (Krijnse Locker 
et al., 1992b; Machamer and Rose, 1987; Mayer ez al., 1988). The 
M proteins of a subset of group 1 coronaviruses (TGEV, FIPV, and 
CCoV) each contain a cleavable amino-terminal signal sequence 
(Laude et al., 1987), although this element may not be required for 
membrane insertion (Kapke et al., 1988; Vennema et al., 1991). Anoth- 
er anomalous feature of at least one group 1 coronavirus, TGEV, is that 
roughly one-third of its M protein assumes a topology in which part of 
the endodomain constitutes a fourth transmembrane segment, thereby 
positioning the carboxy terminus of the molecule on the exterior of the 
virion (Risco et al., 1995). This alternative configuration of M has yet to 
be demonstrated for other coronavirus family members. 


D. Envelope Protein (E) 


The E protein (formerly called sM) is a small polypeptide, ranging 
from 8.4 to 12 kDa (76-109 amino acids), that is only a minor constitu- 
ent of virions (Fig. 3). Owing to its tiny size and limited quantity, E was 
recognized as a virion component much later than were the other 
structural proteins, first in IBV (Liu and Inglis, 1991) and then in 
TGEV (Godet et al., 1992) and MHV (Yu et al., 1994). Its significance 
was also obscured by the fact that in some coronaviruses, the coding 
region for E protein occurs as the furthest-downstream open reading 
frame (ORF) in a bi- or tricistronic mRNA and must therefore be 
expressed by a nonstandard translational mechanism (Boursnell 
et al., 1985; Budzilowicz and Weiss, 1987; Leibowitz et al., 1988; Liu 
et al., 1991; Skinner et al., 1985; Thiel and Siddell, 1994). E protein 
sequences are extremely divergent across the three coronavirus groups 
and in some cases, among members of a single group. Nevertheless, 
the same general architecture can be discerned in all E proteins: a 
short hydrophilic amino terminus (8-12 residues), followed by a large 
hydrophobic region (21—29 residues) containing two to four cysteines, 
and a then hydrophilic carboxy-terminal tail (39-76 residues), the 
latter constituting most of the molecule. 

E is an integral membrane protein, as has been shown for both the 
MHV and IBV E proteins by the criterion of resistance to alkaline 
extraction (Corse and Machamer, 2000; Vennema et al., 1996), and 
membrane insertion occurs without cleavage of a signal sequence 
(Raamsman et al., 2000). The E protein of IBV has been shown to be 
palmitoylated on one or both of its two cysteine residues (Corse 
and Machamer, 2002), but it is not currently clear whether this 
modification is a general characteristic. One study of MHV E showed 
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a gel mobility shift of E caused by hydroxylamine treatment, which 
cleaves thioester linkages (Yu et al., 1994), but attempts to incorporate 
labeled palmitic acid into either the TGEV or MHV E protein have 
been unsuccessful (Godet et al., 1992; Raamsman et al., 2000). The 
topology of E in the membrane is at least partially resolved. Although 
one early report suggested a C-exo, N-endo membrane orientation for 
the TGEV E protein (Godet et al., 1992), more extensive investigations 
of the MHV and IBV E proteins both concluded that the carboxy- 
terminal tail of the molecule is cytoplasmic (or, correspondingly, is 
situated in the interior of the virion) (Corse and Machamer, 2000; 
Raamsman et al., 2000). Moreover, for IBV E, it was shown that the 
carboxy-terminal tail, in the absence of the membrane-bound domain, 
specifies targeting to the budding compartment (Corse and Machamer, 
2002). The status of the amino terminus is less clear, however. The IBV 
E protein amino terminus was inaccessible to antibodies at the cyto- 
plasmic face of the Golgi membrane, suggesting that this end of the 
molecule is situated in the lumen (corresponding to the exterior of 
the virion) (Corse and Machamer, 2000). Such a single transit, placing 
the termini of the protein on opposite faces of the membrane, would be 
consistent with prediction, by molecular dynamics simulations, that a 
broad set of E proteins occur as transmembrane oligomers (Torres 
et al., 2005). Conflicting results were obtained with MHV E, though. 
Based on the cytoplasmic reactivity of an engineered amino-terminal 
epitope tag, it was proposed that the MHV E protein amino terminus is 
buried within the membrane near the cytoplasmic face (Maeda et al., 
2001). This result also accords with the finding that no part of the 
MHV E protein in purified virions is accessible to protease treatment 
(Raamsman et al., 2000). Such an orientation would mean that the 
hydrophobic domain of E protein forms a hairpin, looping back through 
the membrane. This topology agrees with the outcome of a biophysical 
analysis of the SARS-CoV E protein transmembrane domain (Arbely 
et al., 2004). However, in the latter study it was asserted that the 
palindromic hairpin configuration of the transmembrane segment is 
unique to the SARS-CoV E protein, which begs the question of how the 
other coronavirus E proteins are situated in the membrane and why 
the E protein of SARS-CoV should differ. 


E. Nucleocapsid Protein (N) 


The N protein, which ranges from 43 to 50 kDa, is the protein com- 
ponent of the helical nucleocapsid and is thought to bind the genomic 
RNA in a beads-on-a-string fashion (Laude and Masters, 1995) (Fig. 3). 
Based on a comparison of sequences of multiple strains, it has been 
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proposed that the MHV N protein is divided into three conserved 
domains, which are separated by two highly variable spacer regions 
(Parker and Masters, 1990). Domains 1 and 2, which constitute most of 
the molecule, are rich in arginines and lysines, as is typical of many 
viral RNA-binding proteins. In contrast, the short, carboxy-terminal 
domain 3 has a net negative charge resulting from an excess of acidic 
over basic residues. While there is now considerable evidence to sup- 
port the notion that domain 3 truly constitutes a separate domain 
(Hurst et al., 2005; Koetzner et al., 1992), little is known about the 
structure of the other two putative domains. The overall features of the 
three-domain model appear to extend to N proteins of coronaviruses 
in groups 1 and 3, although the boundaries between domains appear 
to be less clearly defined for these latter N proteins. There is not a 
high degree of intergroup sequence homology among N proteins, with 
the exception of a strongly conserved stretch of 30 amino acids, 
near the junction of domains 1 and 2, which contains many aromatic 
hydrophobic residues (Laude and Masters, 1995). 

The main activity of N protein is to bind to the viral RNA. Unlike the 
helical nucleocapsids of nonsegmented negative-strand RNA viruses, 
coronavirus ribonucleoprotein complexes are quite sensitive to the 
action of ribonucleases (Macnaughton et al., 1978). A significant por- 
tion of the stability of the nucleocapsid may derive from N—N monomer 
interactions (Narayanan et al., 2003b). Both sequence-specific and 
nonspecific modes of RNA binding by N have been assayed in vitro 
(Chen et al., 2005; Cologna et al., 2000; Masters, 1992; Molenkamp and 
Spaan, 1997; Nelson and Stohlman, 1993; Nelson et al., 2000; Robbins 
et al., 1986; Stohlman et al., 1988; Zhou et al., 1996). Specific RNA 
substrates that have been identified for N protein include the positive- 
sense transcription regulating sequence (Chen et al., 2005; Nelson 
et al., 2000; Stohlman et al., 1988), regions of the 3’ UTR (Zhou et al., 
1996) and the N gene (Cologna et al., 2000), and the genomic RNA 
packaging signal (Cologna et al., 2000; Molenkamp and Spaan, 1997) 
(Section IV.C). The RNA-binding capability of the MHV N protein has 
been mapped to domain 2 of this molecule (Masters, 1992; Nelson and 
Stohlman, 1993). However, for IBV, two separate RNA-binding sites 
have been found to map, respectively, to amino- and carboxy-terminal 
fragments of N protein (Zhou and Collisson, 2000), and RNA-binding 
activity has been reported for a fragment of the SARS-CoV N protein 
containing parts of domains 1 and 2 (Huang et al., 2004b). 

N is a phosphoprotein, as has been shown for MHV, IBV, BCoV, TGEV, 
and SARS-CoV (Calvo et al., 2005; King and Brian, 1982; Lomniczi 
and Morser, 1981; Stohlman and Lai, 1979; Zakhartchouk et al., 
2005). For MHV N, phosphorylation occurs exclusively on serine resi- 
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dues (Siddell et al., 1981; Stohlman and Lai, 1979), but in IBV N a 
phosphothreonine residue was also found (Chen et al., 2005). Kinetic 
analysis has shown that MHV N protein acquires phosphates rapidly 
following its synthesis (Siddell et al., 1981; Stohlman et al., 1983), and 
phosphorylation may lead to the association of N with intracellular 
membranes (Calvo et al., 2005; Stohlman et al., 1983). Although some 
15% of the amino acids of coronavirus N proteins are candidate phos- 
phoacceptor serines and threonines, phosphorylation appears to be 
targeted to a small subset of residues. For MHV, this was concluded 
both from the degree of charge heterogeneity of N protein observed in 
two-dimensional gel electrophoresis and from the limited number of 
tryptic phosphopeptides of N that could be separated by HPLC (Bond 
et al., 1979; Wilbur et al., 1986). Mass spectrometry has been employed 
to map the sites of phosphorylation of the IBVand TGEV N proteins. For 
IBV N, this was accomplished by comparison of unphosphorylated 
N protein expressed in bacteria with phosphorylated N protein ex- 
pressed in insect cells (Chen et al., 2005). Four sites of phosphorylation 
were found, two each in domains 2 and 3: Ser190, Ser192, Thr378, and 
Ser379. For TGEV N, purified virions and multiple fractions from in- 
fected cells were analyzed (Calvo et al., 2005). Here also, four sites of 
phosphorylation were found, one in domain 1 and three in domain 2: 
Ser9, Ser156, Ser254, and Ser256. In both of these analyses, the degree 
of sequence coverage achieved did not entirely rule out the possibility of 
additional, undetected phosphorylated residues in each of these 
N proteins. 

The role of N protein phosphorylation is currently unresolved, but 
this modification has long been speculated to have regulatory signifi- 
cance. In vitro binding evidence has been presented that phosphory- 
lated IBV N is better able to distinguish between viral and nonviral 
RNA substrates than is nonphosphorylated N (Chen et al., 2005). 
Possibly related to this result is the early conclusion, inferred from 
the differential accessibilities of some monoclonal antibodies, that 
phosphorylation induces a conformational change in the MHV N pro- 
tein (Stohlman et al., 1983). It has also been found that only a subset 
of the intracellular phosphorylated forms of BCoV N protein are 
incorporated into virions, suggesting that phosphorylation is linked 
to virion assembly and maturation (Hogue, 1995). The recent mapping 
of at least some of the N phosphorylation sites in some coronaviruses 
has now laid the groundwork for testing of the hypothetical functions 
of phosphorylation by reverse genetic methods. 

A number of potential activities, other than its structural role in the 
virion, have been put forward for N protein. Based on the specific 
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binding of N protein to the transcription-regulating sequence within 
the leader RNA, it has been proposed that N participates in viral 
transcription (Baric et al., 1988; Choi et al., 2002; Stohlman et al., 
1988). However, an engineered HCoV-229E replicon RNA that was 
devoid of the N gene and all other structural protein genes retained 
the capability to synthesize subgenomic RNA (Thiel et al., 2001b). 
Thus, if N protein does function in transcription, it must be in a 
modulatory, but not essential, capacity. Likewise, the binding of N 
protein to leader RNA has been implicated as a means for preferential 
translation of viral mRNAs (Tahara et al., 1994, 1998), although data 
supporting this attractive hypothesis are, as yet, incomplete. N protein 
has also been found to enhance the efficiency of replication of replicon 
or genomic RNA in reverse genetic systems in which infections are 
initiated from engineered viral RNA (Almazan et al., 2004; Schelle 
et al., 2005; Thiel et al., 2001a; Yount et al., 2002). This may be indica- 
tive of a direct role of N in RNA replication, but it remains possible that 
the enhancement actually results from the sustained translation of a 
limiting replicase component. 

Finally, it was shown that, in addition to its presence in the cyto- 
plasm, IBV N protein localized to the nucleoli of about 10% of cells that 
were infected with IBV or were independently expressing N protein 
(Hiscox et al., 2001). This observation was extended to the N proteins 
of MHV and TGEV, suggesting that nucleolar localization is a general 
feature of all three coronavirus groups. Such localization was proposed 
to correlate with the arrest of cell division (Wurm et al., 2001). Addi- 
tionally, both MHV and IBV N proteins were found to bind to two 
nucleolar proteins, fibrillarin and nucleolin (Chen et al., 2002). It must 
be noted, however, that nucleolar localization of N was not observed in 
TGEV-infected or SARS-CoV-infected cells by other groups of investi- 
gators (Calvo et al., 2005; Rowland et al., 2005). All steps of corona- 
virus replication are thought to occur outside of the nucleus. For MHV, 
it was shown some time ago that viral replication could occur in 
enucleated cells or in cells treated with actinomycin D or a-amanitin, 
host RNA polymerase inhibitors (Brayton et al., 1981; Wilhelmsen 
et al., 1981). By contrast, other studies reported that similar conditions 
reduced the growth yield of IBV, HCoV-229E, or FCoV (Evans and 
Simpson, 1980; Kennedy and Johnson-Lussenburg, 1979; Lewis 
et al., 1992). Even if coronavirus replication does not have an absolute 
dependence on the nucleus, the possibility remains that some viruses 
can alter host nuclear functions so as to create an environment more 
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favorable for viral infection. Such a modification might be brought 
about through the nuclear trafficking of one or more viral components. 


F. Genome 


The genomes of coronaviruses are nonsegmented, single-stranded 
RNA molecules of positive sense, that is, the same sense as mRNA 
(Fig. 4) (Lai and Stohlman, 1978; Lomniczi and Kennedy, 1977; 
Schochetman et al., 1977; Wege et al., 1978). Structurally they resem- 
ble most eukaryotic mRNAs, in having both 5’ caps (Lai and Stohlman, 
1981) and 3’ poly(A) tails (Lai and Stohlman, 1978; Lomniczi, 1977; 
Schochetman et al., 1977; Wege et al., 1978). Unlike most eukaryotic 
mRNAs, coronavirus genomes are extremely large—nearly three times 
the size of alphavirus and flavivirus genomes and four times the size of 
picornavirus genomes. Indeed, at lengths ranging from 27.3 (HCoV- 
229E) to 31.3 kb (MHV), coronavirus genomes are among the largest 
mature RNA molecules known to biology. Again, unlike most eukary- 
otic mRNAs, coronavirus genomes contain multiple ORFs. The genes 
for the four canonical structural proteins discussed previously account 
for less than one-third of the coding capacity of the genome and are 
clustered at the 3’ end. A single gene, which encodes the viral repli- 
case, occupies the 5’-most two-thirds of the genome. The invariant 
gene order in all members of the coronavirus family is 5/-replicase- 
S-E-M-N-3’. However, engineered rearrangement of the gene order of 
MHV was found to be completely tolerated by the virus (de Haan et al., 
2002b). This implies that the native order, although it became fixed 











Fic 4. Coronavirus genomic organization. The layout of the MHV genome is shown as 
an example. All coronavirus genomes have a 5’ cap and 3’ poly(A) tail. The invariant 
order of the canonical genes is replicase-S-E-M-N. The replicase contains two ORFs, la 
and 1b, complete expression of which is accomplished via ribosomal frameshifting. 
Accessory proteins (2a, HE, 4, 5a, and I, in the case of MHV) occur at various positions 
among the canonical genes. 
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early in the evolution of the family, is not functionally essential. At the 
termini of the genome are a 5’ UTR, ranging from 210 to 530 nucleo- 
tides, and a 3’ UTR, ranging from 270 to 500 nucleotides. The noncod- 
ing regions between the ORFs are generally quite small; in some cases, 
there is a small overlap between adjacent ORFs. Additionally, one or 
a number of accessory genes are intercalated among the structural 
protein genes. 

In common with almost all other positive-sense RNA viruses, the 
genomic RNA of coronaviruses is infectious when transfected into 
permissive host cells, as was originally shown for TGEV (Norman 
et al., 1968), IBV (Lomniczi, 1977; Schochetman et al., 1977), and 
MHV (Wege et al., 1978). The genome has multiple functions during 
infection. It acts initially as an mRNA that is translated into the huge 
replicase polyprotein, the complete synthesis of which requires a ribo- 
somal frameshifting event (Section V.C.1). The replicase is the only 
translation product derived from the genome; all downstream ORFs 
are expressed from subgenomic RNAs. The genome next serves as the 
template for replication and transcription (Section V). Finally, the 
genome plays a role in assembly, as progeny genomes are incorporated 
into progeny virions (Section IV.C). 


G. Accessory Proteins 


Interspersed among the set of canonical genes, replicase, S, E, M, 
and N, all coronavirus genomes contain additional ORFs, in a wide 
range of configurations. As shown in Table II, these “extra” genes can 
fall in any of the genomic intervals among the canonical genes and can 
vary from as few as one (PEDV and HCoV-NL63) to as many as eight 
genes (SARS-CoV). In some cases, accessory genes can be entirely 
embedded in another ORF, as the internal (I) gene found within the 
N gene of many group 2 coronviruses (Fischer et al., 1997a; Lapps 
et al., 1987; Senanayake et al., 1992), or they can be extensively over- 
lapped with another gene, as the 3b gene of SARS-CoV. In addition, 
many accessory genes do not constitute the 5’-most ORF in the largest 
subgenomic RNA in which they appear, and they therefore must re- 
quire nonstandard translation mechanisms for their expression (Liu 
et al., 1991). Intracellular expression has been demonstrated for a 
number of accessory proteins, but for many others it is at present 
merely speculative. 

The coronavirus accessory genes were originally labeled nonstruc- 
tural, but this is not entirely apt, since the products of some of them, 
the group 2 HE protein, the I protein (Fischer et al., 1997a), and the 
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TABLE II 
Coronavirus AccEssorY PROTEINS 
Group Virus species Accessory genes (Proteins)* 
1 TGEV [rep] - [S] - 3a, 3b - [E] - [M] - [N] - 7 
FIPV [rep] - [S] - 3a, 3b, 3c - [E] - [M] - [N] - 7a, 7b 
HCoV-229E [rep] - [S] - 4a, 4b - [E] - [M] - [N] 
PEDV [rep] - [S] - 3 - [E] - [M] - [N] 
HCoV-NL63 [rep] - [S] - 3 - [E] - [M] - [N] 
2 MHV [rep] - 2a, 2b(HE) - [S] - 4 - 5a, [E] - [M] - [IN], 7b(1) 
BCoV [rep] - 2a - 2b(HE) - [S] - 4a(4.9k), 
4b(4.8k) - 5(12.7k) [E] - [M] - [N], Tb 
HCoV-0C43 [rep] - 2a - 2b(HE) - [S] - 5(12.9k) - [E] - [M] - [N], 7b(D) 
SARS-CoV [rep] - [S] - 3a, 3b - [E] - [M] - 6 - 7a, 7b - 8a, 8b - [N], 9b(D) 
HCoV-HKU1 [rep] - 2(HE) - [S] - 4 - [E] - [M] - [IN], 7b(1) 
Bat-SARS-CoV [rep] - [S] - 3 - [E] - [M] - 6 - 7a, 7b - 8 - [N], 9b(1) 
3 IBV [rep] - [S] - 3a, 3b, 3c - [E] - [M] - 5a, 5b - [N] 








* Accessory genes and proteins are listed only for coronaviruses for which a complete 
genomic sequence is available. The protein product is indicated in parentheses in cases 
where it has a different designation than the gene. Products of separate transcripts are 
separated by hyphens; the transcription of accessory genes may vary among different 
strains of the same virus species (O’Connor and Brian, 1999). The canonical coronavirus 
genes are indicated in brackets; rep denotes replicase. 


SARS-CoV 3a protein, have been shown to be components of virions. 
Accessory genes were also previously called group-specific genes, but 
this appellation has become a misnomer in light of the diversity re- 
vealed by recently discovered coronaviruses. In general, accessory 
genes are numbered according to the subgenomic RNA in whose 
unique region they appear, but this nomenclature system is sometimes 
overridden by historical precedent. As a result, identically numbered 
genes in two different viruses, for example, the 5a genes of MHV 
and IBV, do not necessarily occupy the same genomic position. Like- 
wise, two identically numbered genes, for example, the 3a genes of 
SARS-CoV and TGEV, do not necessarily have any sequence homology. 

It is often speculated that the coronavirus accessory genes were 
horizontally acquired from cellular or heterologous viral sources, but 
only in two cases, the group 2 HE and 2a genes, is there good evidence 
for this proposal. HE, the most clear-cut example, is discussed 
later. A possible function for the 2a protein has been inferred from a 
bioinformatics analysis, which places it in a very large family of cellu- 
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lar and viral 2’,3’-cyclic phosphodiesterases (Mazumder et al., 2002). 
Besides its presence in some group 2 coronaviruses, this gene also 
appears in another family within the Nidovirales order, the toro- 
viruses (Snijder et al., 1990). Curiously, in the toroviruses, the 2a 
homolog is situated as a module within the replicase polyprotein, 
suggesting either that it was acquired independently or that there 
was nonhomologous recombination between ancestors of viruses within 
the two families (Snijder et al., 1991). However, most accessory gene 
ORFs have no obvious homology to any other viral or cellular sequence 
in public databases. It is conceivable that many of them evolved in 
individual coronaviruses by the scavenging of ORFs from the virus’s 
own genome, through duplication and subsequent mutation, as has 
been proposed for several of the accessory proteins of SARS-CoV 
(Inberg and Linial, 2004). It is tempting to regard this as a possible 
origin for the SARS-CoV 3a protein, which has a topology and size 
remarkably similar to that of the M protein, although there is no 
sequence similarity between the two. Such a relationship would paral- 
lel that in the arteriviruses, another Nidovirales family, in which 
the major envelope glycoprotein is also a triple-spanning membrane 
protein and forms heterodimers with its M protein (Snijder and 
Meulenberg, 1998). 

It also needs to be considered that, although there is evidence that 
some accessory genes encode “luxury” functions for their respective 
viruses, other accessory genes may be genetic junk. Many isolates of 
IBV contain an extremely diverged segment of some 200 nucleotides 
between the N gene and the 3’ UTR (Sapats et al., 1996). This was 
long considered to be a hypervariable region of the 3’ UTR, although it 
was shown to be dispensable for RNA synthesis (Dalton et al., 2001). 
Intriguingly, coronavirus sequences closely related to IBV have been 
characterized in pigeons and geese. These sequences have one and 
two additional ORFs, respectively, between the N gene and the 
3’ UTR (Jonassen et al., 2005). This finding suggests that the IBV 
hypervariable region and the PCoV ORF are degenerate remnants of 
a precursor retained in the GCoV sequence. The two GCoV ORFs, in 
turn, may be vestiges of one or more functional ancestral genes, or they 
may be derived from horizontally acquired sequences that there has 
been no selective pressure to eliminate. A similar situation probably 
pertains for the SARS-CoV 8a and 8b genes. Isolates of SARS-CoV 
from marketplace animals near the source of the epidemic were found 
to contain an additional 29 nucleotides absent from all but one previ- 
ously reported human isolate, and this apparent insertion resulted in 
the fusion of ORFs 8a and 8b into a single ORF 8 (Guan et al., 2003). 
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One scenario consistent with this observation is that loss of the 29-nt 
sequence was concomitant with the jump of the virus from animals to 
humans, although the functional significance of this loss, if any, is not 
yet clear. 

In all cases examined, through natural or engineered mutants, ac- 
cessory protein genes have been found to be nonessential for viral 
replication in tissue culture. This dispensability has been determined 
for the 2a and HE genes of MHV (de Haan et al., 2002a; Schwarz et al., 
1990), genes 4 and 5a of MHV (de Haan et al., 2002a; Weiss et al., 1993; 
Yokomori and Lai, 1991), the J gene of MHV (Fischer et al., 1997a), 
gene 7 of TGEV (Ortego et al., 2003), genes 7a and 7b of FIPV (Haijema 
et al., 2003, 2004), and genes 5a and 5b of IBV (Casais et al., 2005; 
Youn et al., 2005). Similarly, some accessory protein genes do not seem 
to play any role in infection of the natural host. For gene 4 (Ontiveros 
et al., 2001) and the I gene (Fischer et al., 1997a) of MHV, and for gene 
7b of FIPV (Haijema et al., 2003), selective knockout produced no 
detectable effect on pathogenesis in mice or cats, respectively. By 
contrast, disruption of gene 7 of TGEV greatly reduced viral replica- 
tion in the lung and gut of infected piglets (Ortego et al., 2003). In the 
same manner, viruses with knockouts of either the 3abc gene cluster or 
genes 7a and 7b in FIPV produced no clinical symptoms in cats at 
doses that were fatal with wild-type virus (Haijema et al., 2004). The 
deletion of genes 2a and HE, or of genes 4 and 5a, in MHV completely 
abrogated the lethality of intracranial infection in mice (de Haan et al., 
2002a). Even a single point mutation in MHV ORF 2a, which had 
no effect in tissue culture, was found to greatly attenuate virulence 
in vivo (Sperry et al., 2005). In a study that took the opposite approach 
to assessing accessory protein function, it was discovered that engi- 
neered insertion of gene 6 of SARS-CoV greatly enhanced the virulence 
of an attenuated variant of MHV (Pewe et al., 2005). 

The most extensively characterized accessory protein is HE (formerly 
called E3), which is a fourth constituent of the membrane envelope in 
many group 2 coronaviruses (Brian et al., 1995). HE forms a second set 
of small spikes that appear as an understory among the tall S protein 
spikes. It was first identified as a hemagglutinin in HEV (Callebaut and 
Pensaert, 1980) and BCoV (King and Brian, 1982; King et al., 1985). The 
HE monomer has an N-exo, C-endo transmembrane topology, with an 
amino-terminal signal peptide, a large ectodomain, a transmembrane 
anchor, and a very short, carboxy-terminal endodomain. Monomers of 
HE, prior to glycosylation are 48 kDa; this size increases to 65 kDa after 
addition and processing of oligosaccharide, which is exclusively 
N-linked (Hogue et al., 1989; Kienzle et al., 1990; Yokomori et al., 
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1989). The mature protein is a homodimer that is stabilized by both 
intrachain and interchain disulfide bonds (Hogue et al., 1989). The 
hemagglutinating property of HE raised the possibility that, in the 
viruses in which it appears, this protein may duplicate or replace the 
role that is assigned to the coronavirus S protein. However, it has been 
shown, through the construction of MHV-BCoV chimeric viruses, that 
the BCoV HE protein, in the absence of BCoV S protein, is not sufficient 
for initiation of infection in tissue culture (Popova and Zhang, 2002). 

The HE protein also contains an acetylesterase activity. This was 
originally discovered in BCoV and HCoV-OC43, where it was shown to 
be similar to the receptor-binding and receptor-destroying activity 
found in influenza C virus (Vlasak et al., 1988a, b). The nature of the 
esterase enzyme has subsequently been comprehensively studied and 
compared among a number of group 2 coronaviruses (Klausegger et al., 
1999; Regl et al., 1999; Smits et al., 2005). HE proteins of BCoV, HCoV- 
OC43, ECoV, and MHV strain DVIM were found to be sialate-9-O- 
acetylesterases. By contrast, HE proteins of RCoV, and MHV strains 
S and JHM were found to be sialate-4-O-acetylesterases. Surprisingly, 
the coronavirus HE gene is clearly related to the influenza C virus HA1 
gene (Luytjes et al., 1988). Equally remarkably, toroviruses also pos- 
sess a homolog of the HE gene but at a different genomic locus than 
where it appears in the group 2 coronaviruses (Cornelissen et al., 
1997). This may be evidence of genetic trafficking among pairs of 
ancestors of these three viruses, as was originally proposed (Luytjes 
et al., 1988; Snijder et al., 1991). Alternatively, it may indicate that 
members of different virus families independently acquired the HE 
gene by horizontal transfer from cellular sources (Cornelissen et al., 
1997). 

There are two ways in which HE could act in coronavirus replication. 
It could serve as a cofactor for S, assisting attachment of virus to host 
cells. Additionally, it could prevent aggregation of progeny virions and 
travel of virus through the extracellular mucosa (Cornelissen et al., 
1997). The role of HE protein in coronavirus infection has been sys- 
tematically documented in a recent pair of elegant studies (Kazi et al., 
2005; Lissenberg et al., 2005). To evaluate the cost and benefit of the 
HE gene, three isogenic MHV mutants were engineered: HE+, with an 
expressed and functional HE gene; HE®, with an expressed HE gene 
that was inactive, owing to active site point mutations; and HE7, 
which lacked HE expression because of an introduced frameshift. It 
was demonstrated that, following multiple passages, there was rapid 
loss of HE expression in the HE* virus. Moreover, competition experi- 
ments showed a growth advantage for the HE” virus, but not the HE? 
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virus. Consistent with this, examination of esterase-negative mutants 
arising from the HE* virus showed that it was not loss of activity, but, 
rather, loss of the ability of HE to be incorporated into virions that 
correlated with the growth advantage of HE viruses (Lissenberg 
et al., 2005). By contrast, in infections of mice, it was found that the 
presence of HE (whether or not it was enzymatically active) dramati- 
cally enhanced neurovirulence, as measured by viral spread and le- 
thality (Kazi et al., 2005). These results imply that sialic acid—bearing 
coreceptors can function to influence the course of MHV infection. Thus, 
the HE protein is a burden in vitro but provides an advantage to the 
virus in vivo. The selection against HE in vitro provides a cautionary 
example that tissue culture adaptation of a virus can rapidly lead 
to selection of a variant that differs from the natural isolate. 


IV. ViraAt REPLICATION CYCLE AND VIRION ASSEMBLY 


Coronavirus infections are initiated by the binding of virions to cellu- 
lar receptors (Fig. 5). This sets off a series of events culminating in the 
deposition of the nucleocapsid into the cytoplasm, where the viral ge- 
nome becomes available for translation. The positive-sense genome, 
which also serves as the first mRNA of infection, is translated into the 
enormous replicase polyprotein. The replicase then uses the genome as 
the template for the synthesis, via negative-strand intermediates, of 
both progeny genomes and a set of subgenomic mRNAs. The latter 
are translated into structural proteins and accessory proteins. The 
membrane-bound structural proteins, M, S, and E, are inserted into 
the ER, from where they transit to the endoplasmic reticulum-Golgi 
intermediate compartment (ERGIC). Nucleocapsids are formed from 
the encapsidation of progeny genomes by N protein, and these coalesce 
with the membrane-bound components, forming virions by budding 
into the ERGIC. Finally, progeny virions are exported from infected 
cells by transport to the plasma membrane in smooth-walled vesicles, 
or Golgi sacs, that remain to be more clearly defined. During infection 
by some coronaviruses, but not others, a fraction of S protein that 
has not been assembled into virions ultimately reaches the plasma 
membrane. At the cell surface S protein can cause the fusion of an 
infected cell with adjacent, uninfected cells, leading to the formation 
of large, multinucleate syncytia. This enables the spread of infection 
independent of the action of extracellular virus, thereby providing some 
measure of escape from immune surveillance. Key aspects of 
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Fic 5. The coronavirus life cycle. 
























the coronavirus replication cycle are discussed in more detail in the 
remainder of this section and in the next section (Section V). 


A. Receptors and Entry 
1. Receptors 


The pairings of coronaviruses and their corresponding receptors are 
generally highly species specific, but the adaptation of SARS-CoV to 
the human population has reminded us that this allegiance is mutable. 
Well prior to the emergence of SARS, it was clearly documented that 
another coronavirus, BCoV, was capable of sporadic cross-species 
transmission (Saif, 2004). Viruses very closely related to BCoV had 
been isolated from wild ruminants (Tsunemitsu et al., 1995), domestic 
dogs (Erles et al., 2003), and, in one case, a human child (Zhang et al., 
1994). Nevertheless, the interaction between S protein and receptor 
remains the principal, if not sole, determinant of coronavirus host 
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species range and tissue tropism. At the cellular level, this has been 
demonstrated by manipulation of each of the interacting partners. 
First, expression of an identified receptor in nonpermissive cells, often 
of a heterologous species, invariably has rendered those cells permis- 
sive for the corresponding coronavirus (Delmas et al., 1992; Dveksler 
et al., 1991; Li et al., 2003, 2004; Mossel et al., 2005; Tresnan et al., 
1996; Yeager et al., 1992). Second, the engineered swapping of S pro- 
tein ectodomains has been shown to change the in vitro host cell 
species specificity of MHV to that of FIPV (Kuo et al., 2000) or, con- 
versely, of FIPV to that of MHV (Haijema et al., 2003). Similarly, 
exchange of the relevant regions of S protein ectodomains was shown 
to transform a strictly respiratory isolate of TGEV into a more virulent, 
enterotropic strain (Sanchez et al., 1999). Replacement of the S protein 
ectodomain of MHV strain A59 caused the virus to acquire the highly 
virulent neurotropism of MHV strain 4 (Phillips et al., 1999) or the 
highly virulent hepatotropism of MHV strain 2 (Navas et al., 2001). 
Table III lists the known cellular receptors for coronaviruses of 
groups 1 and 2; to date no receptors have been identified for corona- 
viruses of group 3. Group 2 coronavirus receptors include the earliest 
and the most recent of the items in Table III. The MHV receptor 
(formerly MHVR1, now called mCEACAM1) is a member of the carci- 
noembryonic antigen (CEA) family, a group of proteins within the 
immunoglobulin (Ig) superfamily. CEACAM1 was the first receptor 
discovered for a coronavirus, and, indeed, it was one of the first recep- 
tors found for any virus (Williams et al., 1990, 1991). Cloning of cDNA 
to the largest mRNA for this protein revealed that full-length CEA- 
CAM1 has four Ig-like domains (Dveksler et al., 1991), but a number of 
two- and four-domain versions of the molecule were later found to 
be expressed in mouse cells. This diversity of MHV receptor isoforms 
was found to be generated by multiple alleles of the Ceacam1 gene 
as well as by the existence of multiple alternative splicing variants 
of its mRNA (Compton, 1994; Dveksler et al., 1993a,b; Ohtsuka and 
Taguchi, 1997; Ohtsuka et al., 1996; Yokomori and Lai, 1992). The wide 
range of pathogenicity of MHV in mice is therefore thought to result 
from the interactions of S proteins of different virus strains with the 
tissue-specific spectra of receptor variants displayed in mice having 
different genetic backgrounds. A number of lines of evidence argue 
that CEACAM1 is the only biologically relevant receptor for MHV. 
This was initially suggested by an early experiment showing that 
in vivo administration of a monoclonal antibody to CEACAM1 greatly 
enhanced the frequency of survival of mice subsequently given a lethal 
challenge of MHV (Smith et al., 1991). More definitively, it was demon- 
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TABLE III 
CORONAVIRUS RECEPTORS 
Group Virus species Receptor Reference 
1 TGEV Porcine aminopeptidase N(pAPN) Delmas et al., 1992 
PRCoV Porcine aminopeptidase N(pAPN) Delmas et al., 1994b 
FIPV Feline aminopeptidase N (fAPN) Tresnan et al., 1996 
FCoV Feline aminopeptidase N (fAPN) Tresnan et al., 1996 
CCoV Canine aminopeptidase N (cAPN) Benbacer et al., 1997 
HCoV-229E Human aminopeptidase N(hAPN) Yeager et al., 1992 
HCoV-NL63 Angiotensin-converting enzyme Hofmann et al., 2005 
2 (ACE2) 
2 MHV Murine carcinoembryonic Nedellec et al., 1994*; 
antigen-related Williams et al., 1991 
adhesion molecules 1 and 2* 
(mCEACAM1, mCEACAM2*) 
BCoV 9-O-acetyl sialic acid Schultze et al., 1991 
SARS-CoV Angiotensin-converting enzyme Li et al., 2003 


2 (ACE2) 


CD209L (L-SIGN) Jeffers et al., 2004 


* The mCEACAM2 molecule functions as a weak MHV receptor in tissue culture but 
does not serve as an alternate receptor in vivo (Hemmila et al., 2004). 


strated that homozygous Ceacam1 knockout mice were totally resis- 
tant to infection by high doses of MHV (Hemmila et al., 2004). Thus, 
even though CEACAM2, the product of the other murine Ceacam 
gene family member, can function as a weak MHV receptor in tissue 
culture (Nedellec et al., 1994), it cannot be used as an alternative 
receptor in vivo. 

Initial studies of the structural requirements for CEACAM1 func- 
tion showed that the molecule must be glycosylated in order to be 
functional as an MHV receptor (Pensiero et al., 1992). Moreover, the 
amino-terminal Ig-like domain was found to be the part of the molecule 
that is bound both by MHV S protein and by the monoclonal antibody 
originally used to identify the receptor (Dveksler et al., 1993b). The 
essential difference between high-affinity and low-affinity S binding 
receptor alleles has been mapped to a determinant as small as 
six amino acid residues on the amino-terminal domain (Rao et al., 
1997; Wessner et al., 1998). These critical residues, it turns out, 
fall within a prominent, uniquely convoluted loop in the recently 
solved x-ray crystallographic structure for a two-Ig-domain isoform 
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of CEACAM1 (Tan et al., 2002). Notably, this loop was found to 
be topologically similar to protruding loops of the virus-binding do- 
mains of the receptors for rhinoviruses, HIV, and measles, all of which, 
like CEACAM1, are cell adhesion molecules. The CEACAM1 struc- 
ture now provides the basis for beginning to understand the relative 
affinities of receptor variants for different S protein ligands. 

Other group 2 coronaviruses use different receptors. The rat coro- 
naviruses RCoV and SDAV, although closely related to MHV and able 
to grow in some of the same cell lines as does MHV, do not gain entry to 
cells via mCEACAM1. Anti-CEACAM1 monoclonal antibody, which 
totally blocks MHV infection, was shown to have no effect on infection 
by rat coronaviruses; moreover, expression of mCEACAM1 in nonper- 
missive BHK cells rendered them susceptible to MHV but not to rat 
coronaviruses (Gagneten et al., 1996). BCoV is phylogenetically close 
to MHV, but the two viruses neither share common hosts nor are they 
supported by any of the same cell lines in tissue culture. To date, the 
only identified cell attachment factor for BCoV is 9-O-acetyl sialic acid 
(Schultze et al., 1991), but it is not yet clear whether this moiety must 
be linked to specific proteins or glycolipids or whether there is also a 
specific cellular protein receptor for BCoV. 

Not surprisingly, SARS-CoV, which is phylogenetically most distant 
from all other group 2 coronaviruses, uses a receptor wholly unrelated 
to CEACAMs. The SARS-CoV receptor, which was found in remarkably 
short order after the discovery of the virus, is angiotensin-converting 
enzyme 2 (ACE2). This was identified through the use of a SARS-CoV 
S1-IgG fusion protein to immunoprecipitate membrane proteins from 
Vero E6 cells, an African green monkey kidney cell line that is the best 
in vitro host for SARS-CoV (Li et al., 2003). Binding of S1-IgG to Vero 
E6 cells was inhibited by soluble ACE2 protein but not by a related 
protein, ACE1. Expression of cloned cDNA for ACE2 was then shown 
to render nonpermissive cells susceptible to infection by SARS-CoV 
(Li et al., 2003). ACE2 was also identified by expression cloning of 
an S1-binding activity, and it was shown to render cells infectable 
by a retroviral pseudotype carrying the SARS-CoV S protein (Wang 
et al., 2004). 

ACE2 is a zinc-binding carboxypeptidase that is involved in regula- 
tion of heart function. It is an N-exo, C-endo transmembrane glyco- 
protein with a broad tissue distribution. Active-site mutants of 
ACE2 showed no detectable defects in binding to SARS-CoV S protein 
(Moore et al., 2004) or in promoting S protein-mediated syncytia for- 
mation (Li et al., 2003), suggesting that ACE2 catalytic activity is not 
required for receptor function. This conclusion needs to be verified by 
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direct SARS-CoV infection, however. Recently solved x-ray structures 
for ACE2 have revealed that a large conformational change is induced 
by the binding of an inhibitor in the active site of the enzyme (Towler 
et al., 2004). Although this finding raised the possibility of a means to 
interfere with the initiation of infection, the inhibitor does not affect 
S protein binding or receptor function of ACE2 (Li et al., 2005a). 

Numerous cell lines from a range of species have been classified with 
respect to their permissivity or nonpermissivity to SARS-CoV (Gillim- 
Ross et al., 2004; Giroglou et al., 2004; Mossel et al., 2005), thereby 
allowing inferences as to which species homologs of ACE2 could have 
some degree of SARS-CoV receptor activity. In direct tests of S1 bind- 
ing, human ACE2 was shown to be a much better receptor than was 
mouse ACE2; the receptor activity of rat ACE2, however, was barely 
detectable above background (Li et al., 2004). In all cases tested, 
nonpermissive cells were shown to be made permissive by expression 
of human ACE2 (Mossel et al., 2005). The full picture of factors influ- 
encing SARS-CoV host and tissue tropism is still developing. Human 
CD209L (also called L-SIGN or DC-SIGNR), a lectin family member, 
has been found to act as a second receptor for SARS-CoV, but it has 
much lower efficiency than does ACE2 (Jeffers et al., 2004). A related 
lectin, DC-SIGN, was identified as a coreceptor, since it was able to 
transfer the virus from dendritic cells to susceptible cells; DC-SIGN 
could not act as receptor on its own, however (Marzi et al., 2004; Yang 
et al., 2004). 

Many group 1 coronaviruses use the aminopeptidase N (APN) of 
their cognate species as a receptor (Table III) (Delmas et al., 1992; 
Tresnan et al., 1996; Yeager et al., 1992). APN (also called CD13) is a 
cell-surface, zinc-binding protease that contributes to the digestion of 
small peptides in respiratory and enteric epithelia; it is also found in 
human neural tissue that is susceptible to HCoV-229E (Lachance 
et al., 1998). The APN molecule is a homodimer; each monomer has a 
C-exo, N-endo membrane orientation and is heavily glycosylated. Com- 
petition experiments with monoclonal antibodies suggested that there 
is some overlap between the catalytic domain of hAPN and the binding 
site for HCoV-229E (Yeager et al., 1992). However, neither the use of 
specific APN inhibitors, nor the mutational disruption of the catalytic 
site of pAPN, affected its TGEV receptor activity, indicating that 
the enzymatic activity of APN, per se, is not required for initiation of 
infection (Delmas et al., 1994a). In general, the receptor activities 
of APN homologs are not interchangeable: hAPN cannot act as a receptor 
for TGEV (Delmas et al., 1994a), and pAPN cannot act as a receptor for 
HCoV229E (Kolb etal., 1996). Curiously, fAPN can serve as a receptor not 
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only for FIPV but also for CCoV, TGEV, and HCoV-229E (Tresnan et al., 
1996). These contrasting properties have been used as the framework for 
dissecting the basis of species-specific or -nonspecific function, through 
the construction and analysis of chimeric receptors (Benbacer et al., 
1997; Delmas et al., 1994a; Hegyi and Kolb, 1998; Kolb et al., 1996, 
1997). However, chimera construction has not revealed a single linear 
determinant for virus binding. Rather, two different regions of the mole- 
cule have been found to influence receptor activity with respect to a given 
coronavirus. A detailed study of one of these regions showed that the 
critical characteristic in chimeras that exclude HCoV-229E is a particu- 
lar glycosylation site. HCoV-229E likely does not directly bind to this 
region of APN, but it is hindered from doing so in homologs that are 
glycosylated at this locus (Wentworth and Holmes, 2001). 

Not all group 1 coronaviruses use APN as a receptor, however. It has 
been proposed that one subset of FIPV strains uses a different recep- 
tor, since an antibody to fAPN blocked replication of type II strains of 
FIPV but not replication of type I strains of FIPV (Hohdatsu et al., 
1998). This conclusion is consistent with the observation that there is 
greater sequence divergence between type I FIPV S proteins and type 
II FIPV S proteins than there is between type II FIPV S proteins and 
the S proteins of CCoV or TGEV (Herrewegh et al., 1998; Motokawa 
et al., 1996). Likewise, although it has been suggested that pAPN can 
facilitate cellular entry of PEDV (Oh et al., 2003), the major receptor 
for PEDV probably differs from that for TGEV, since the two viruses 
are able to grow in mutually exclusive sets of cells lines derived from 
different species (Hofmann and Wyler, 1988). The most outstanding 
exception to the generality of APN as a receptor for group 1 corona- 
viruses is the discovery that HCoV-NL63 cannot use hAPN to initiate 
infection; instead it is able to employ the same receptor as SARS-CoV, 
namely ACE2 (Hofmann et al., 2005). This finding raises very interest- 
ing questions, one of which is why HCoV-NL63 causes a much milder 
respiratory disease than does SARS-CoV. Another is why two very 
different, zinc-binding, cell-surface peptidases, APN and ACE2, should 
serve as receptors for such a substantial number of coronaviruses. This 
situation can currently be ascribed to an amazing coincidence, but it 
may later be found to have deeper significance. 


2. Receptor Recognition 


The more variable of the two portions of the spike molecule, S1, is 
the part that binds to the receptor. Binding leads to a conformational 
change that results in the more highly conserved portion of the spike 
molecule, S2, mediating fusion between virion and cell membranes. 
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Just as different coronaviruses can bind to different receptors, corona- 
viruses also appear to use different regions of S1 with which to do so. 
Receptor-binding domains (RBDs) have so far been mapped in four 
S proteins (Fig. 2). In the group 1 coronavirus TGEV, the RBD was 
localized to amino acids 579-655, a region highly conserved among the 
S proteins of TGEV, PRCoV, FIPV, FCoV, and CCoV (Godet et al., 
1994). For the more distantly related group 1 coronavirus HCoV- 
229E, the RBD was found to fall in an adjacent, nonoverlapping 
segment of S1, amino acids 417-547 (Bonavia et al., 2003). By contrast, 
the RBD of MHV was localized to the amino terminus of the S mole- 
cule, amino acids 1-330 (Kubo et al., 1994; Suzuki and Taguchi, 1996; 
Taguchi, 1995). Finally, the RBD of SARS-CoV was mapped to amino 
acids 270-510 or 303-537 by binding of S protein fragments to Vero 
cells (Babcock et al., 2004; Xiao et al., 2003). These loci were contained 
within a domain shown to harbor the epitope for a neutralizing single- 
chain antibody fragment that blocked S1 association with the ACE2 
receptor (Sui et al., 2004). The SARS-CoV RBD was more finely delim- 
ited, to amino acids 318-510, by analysis of the binding to ACE2 of a 
large set of S1 constructs (Wong et al., 2004). Thus, on a linear map of 
S proteins aligned principally by their S2 domains, the MHV RBD falls 
near the amino end of $1, the SARS-CoV RBD is in the middle of $1, 
and the TGEV and HCoV-229E RBDs fall near the carboxyl end of S1. 
The complementarity of the MHV and TGEV RBD loci is further 
emphasized by the fact that substantial deletions are tolerated in 
TGEV S1 in the region that corresponds to the MHV RBD (Laude 
et al., 1995). Conversely, substantial deletions are tolerated in MHV 
S1 in the region that corresponds to the TGEV RBD (Parker et al., 
1989; Rowe et al., 1997). 

For MHV, persistent infection in tissue culture was shown to lead to 
the selection of variant viruses with an extended host range (Baric 
et al., 1997, 1999; Schickli et al., 1997). These viruses gained the ability 
to grow in cell lines from numerous species not permissive to wild-type 
MHV through an acquired recognition of receptors other than CEA- 
CAM1. Analysis and engineered reconstruction of one of these selected 
variants showed that a relatively small number of amino acid changes 
in the S protein RBD accounted for its extended host range (Schickli 
et al., 2004; Thackray and Holmes, 2004). Comparison of the RBDs of 
various strains of MHV, of the extended host range mutant of MHYV, 
and of other group 2 coronaviruses allowed the identification of five 
residues in the RBD that were uniquely conserved among MHV 
strains (Thackray et al., 2005). Mutations in some of these residues 
were lethal or resulted in viruses that formed very small plaques; in 
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particular, a tyrosine at position 162 of the RBD was proposed as a 
candidate element in a key interaction with the receptor. 

A set of elegant studies with the SARS-CoV S protein and ACE2 has 
provided the most detailed image of RBD-receptor interactions yet 
available for any coronavirus. Aided by the x-ray structure of ACE2, 
Li et al. (2005a) used the rat ACE2 molecule, which has negligible 
receptor activity, as a scaffold to identify critical residues in human 
ACE2. Transfer of as few as four human ACE 2 residues to rat ACE2 
enabled the latter to bind S protein almost as well as human ACE2 did. 
A similar approach was used to determine key S1 residue changes that 
allowed the interspecies jump of SARS-CoV. The S1 domains of two 
SARS-CoV isolates were compared in this analysis: one (TOR2) from 
the main 2002-2003 SARS outbreak, and one (GD) from the 
subsequent 2003-2004 outbreak; the latter outbreak was much less 
severe and did not include any human-to-human transmission. Both 
the TOR2 and GD viruses are thought to have been transmitted to 
humans from palm civets, the final intermediary host in the jump of 
SARS-CoV from an unknown natural reservoir. However, only the 
TOR2 virus efficiently adapted to humans. Correspondingly, it was 
found that the S1 domains of both the TOR2 and GD viruses bound 
to palm civet ACE2, but only TOR2 S1 bound to human ACE2 (Li etal., 
2005a). Binding experiments with numerous chimeric variants were 
used to chart precisely which of the multiple coordinated changes in 
both the S1 RBD and in the human and palm-civet ACE2 could account 
for differences in the mutual affinities of the two molecules. The basis 
for the results that were obtained was then deduced from the x-ray 
structure of human ACE2 in a complex with the SARS-CoV S protein 
RBD (Li et al., 2005b). The RBD was found to bind to the amino- 
terminal, catalytic domain of ACE2, contacting the latter with a 
concave, 71-residue loop. Inspection of the interface of this contact 
revealed that an astonishingly small number of RBD amino acid 
changes were critical to the adaptation of the virus from one species 
homolog of ACE2 to another. A change as subtle as the gain of a methyl 
group (serine to threonine at residue 487 of the RBD) that fits into a 
hydrophobic pocket on the receptor could account for a 20-fold increase 
in affinity of S1 for human ACE2. 


3. S Protein Conformational Change and Fusion 


The binding of spike to its cellular receptor triggers a major confor- 
mational change in the S molecule. In some cases, induction of this 
conformational change may also require a shift to an acidic pH. Thus, 
some coronaviruses, such as MHV, fuse with the plasma membrane at 
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the cell surface (Sturman et al., 1990; Weismiller et al., 1990), while 
others, such as TGEV (Hansen et al., 1998), HCoV-229E (Nomura 
et al., 2004), and SARS-CoV (Hofmann et al., 2004; Simmons et al., 
2004; Yang et al., 2004), appear to enter the cell via receptor-mediated 
endocytosis and then fuse with the membranes of acidified endosomes. 
There may be a very fine balance between these two states. For MHV, 
it was found that as few as three amino acid changes in a heptad repeat 
region in S2 could govern the switch from plasma membrane fusion to 
strictly acid pH-dependent fusion (Gallagher et al., 1991; Nash and 
Buchmeier, 1997). For SARS-CoV, protease treatment of cells at the 
earliest steps of infection was found to allow the virus to enter 
cells from the surface, rather than through an endocytic pathway 
(Matsuyama et al., 2005). Such treatment enhanced the infectivity of 
the virus by orders of magnitude, and this enhancement was receptor 
dependent. Although SARS-CoV S protein is not detectably cleaved 
in virions or pseudovirions produced in tissue culture (Simmons 
et al., 2004; Song et al., 2004), protease treatment may mimic the 
environment resulting from an inflammatory response in infected 
lungs. 

Much of the characterization of the receptor-induced conformational 
change in S was initially carried out with the MHV S protein, for which 
it was found that the effects of receptor binding could also be elicited by 
treatment of virions at mild alkaline pH (Sturman et al., 1990). Such 
treatment caused the dissociation and release of the cleaved S1 sub- 
unit and the aggregation of S2 subunits; the accompanying conforma- 
tional changes in S1 were monitored by differential access of a panel of 
monoclonal antibodies at neutral and alkaline pH (Weismiller et al., 
1990). Disulfide bond formation plays an important role in S protein 
folding, and disulfides in S1 may become rearranged during the con- 
formational transitions of S1 following receptor binding (Lewicki 
and Gallagher, 2002; Opstelten et al., 1993; Sturman et al., 1990). 
The S protein of the highly virulent MHV strain 4 (JHM) has 
been shown to exist in a particularly metastable configuration. This 
results in a hair-trigger spike so highly fusogenic that it can mediate 
fusion between infected cells and cells lacking receptors, thereby 
leading to more extensive neuropathogenesis than occurs with other 
MHV strains (Gallagher and Buchmeier, 2001; Gallagher et al., 1992; 
Krueger et al., 2001; Nash and Buchmeier, 1996). 

In the normal spike-receptor interaction, both the S1-binding and 
the S1-activation functions were found to reside in the amino-terminal 
Ig domain of CEACAM1 (Miura et al., 2004). The role of the additional 
Ig domain(s) in the various CEACAM isoforms is apparently to give the 
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virus access to the amino-terminal Ig domain. Similarly, although the 
RBD of the MHV S protein lies near the amino terminus of S1, portions 
of the molecule distal to this site can significantly influence the stability 
of the Sl-receptor interaction (Gallagher, 1997). The conformational 
change that separates S1 from the rest of the molecule, in turn, trans- 
mits a major change to S2. This secondary change has been monitored 
by the differential susceptibility of S2 to protease treatment before and 
after the binding of S1 to soluble receptor (Matsuyama and Taguchi, 
2002). Additionally, the same changes were shown to be caused, in the 
absence of receptor, by mild alkaline pH, which induced a fusogenic 
state in S2 that could be measured by a liposome flotation assay (Zelus 
et al., 2003). 

It has been realized that the coronavirus S protein is a type I viral 
fusion protein with functional similarities to the fusion proteins of 
phylogenetically distant RNA viruses such as influenza virus, HIV, 
and Ebola virus (Bosch et al., 2003). Similar to its counterparts in 
other viruses, the coronavirus S2 domain contains two separated hep- 
tad repeats, HR1 and HR2, with a fusion peptide upstream of HR1 and 
the transmembrane domain immediately downstream of HR2 (Fig. 2). 
Mutations in the MHV S protein HR1 and HR2 regions were shown to 
inhibit or abolish fusion (Luo and Weiss, 1998; Luo et al., 1999). Unlike 
its counterparts, however, the coronavirus S protein does not require 
cleavage to be fusogenic, and it contains an internal fusion peptide, 
although the exact assignment of this domain is not agreed upon 
(Guillen et al., 2005; Sainz et al., 2005). Even for MHV S and other 
cleaved S proteins, the fusion peptide is not the amino terminus of S2 
created by cleavage (Luo and Weiss, 1998), as is the case in other type I 
fusion proteins. 

The receptor-mediated conformational change in S1 and the dissoci- 
ation of S1 from S2 are thought to initiate a major rearrangement in 
the remaining S2 trimer. This rearrangement exposes a fusion peptide 
that interacts with the host cellular membrane, and it brings together 
the two heptad repeats in each monomer so as to form an antiparallel, 
six-helix “trimer-of-dimers” bundle. The result is the juxtaposition of 
the viral and cellular membranes in sufficient proximity to allow the 
mixing of their lipid bilayers and the delivery of the contents of 
the virion into the cytoplasm. The trimer of dimers is extremely stable, 
forming a rod-like, protease-resistant complex, the biophysical proper- 
ties of which have been studied in depth for the S proteins of 
MHV (Bosch et al., 2003) and SARS-CoV (Bosch et al., 2003, 2004; 
Ingallinella et al., 2004; Liu et al., 2004; Tripet et al., 2004) by the use 
of model peptides. X-ray crystallographic structures have been solved 
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for peptide complexes for both the MHV S protein (Xu et al., 2004a) and 
the SARS-CoV S protein (Duquerroy et al., 2005; Supekar et al., 2004; 
Xu et al., 2004b). In the six-helix bundle, the three HR1 helices were 
found to form a central, coiled-coil core, and the three HR2 helices, in 
an antiparallel orientation, pack into the grooves between the HR1 
monomers. There is no contact between the HR2 monomers, each of 
which associates with the HR1 grooves through hydrophobic interac- 
tions. The overall structures obtained for MHV S and SARS-CoV S are 
highly similar to each other and strongly resemble the structures of 
the fusion cores of influenza virus HA and HIV gp41. Noteworthy 
differences are that the coronavirus HR1 coiled-coil is two to three 
times larger than its counterparts in other viruses and that the much 
shorter coronavirus HR2 helices assume a unique conformation within 
the bundle. A major goal of these studies is the design of peptides that 
are able to inhibit formation of this complex in SARS-CoV infections. 

In addition to the mechanisms of the conformational rearrange- 
ments of S1 and S2, other factors influence coronavirus fusion and 
entry, in ways that are not yet well understood. For two coronaviruses, 
the role of cholesterol in virus entry has been investigated. Cholesterol 
supplementation was found to augment MHV replication, while cho- 
lesterol depletion was inhibitory; these effects were shown to occur at 
the earliest stages of infection (Thorp and Gallagher, 2004). Contrary 
to expectations, the basis for the action of cholesterol was not through 
clustering of CEACAM receptors into lipid rafts, either before or after 
the binding of virus to receptor (Choi et al., 2005; Thorp and Gallagher, 
2004). However, cell-bound virions did cluster into lipid rafts, suggest- 
ing that MHV S protein associates with some host factor other than 
CEACAM prior to entry (Choi et al., 2005). For HCoV-229E, on the 
other hand, both virus and hAPN receptor were found to redistribute 
on the cell surface from an initially disperse pattern to clusters within 
caveolin-1-rich lipid rafts (Nomura et al., 2004). Thus, the mechanism 
by which cholesterol assists infection may differ between corona- 
viruses that enter the cell via receptor-mediated endocytosis and those 
that fuse with the plasma membrane. 

For those coronaviruses that bring about syncytia formation, cell— 
cell fusion appears to have different requirements than virus—cell 
fusion. Studies with MHV have long noted a correlation between the 
degree of S protein cleavage and the amount of cell-cell fusion, both of 
which could be enhanced by trypsin treatment (Sturman et al., 1985). 
The extent and kinetics of S protein cleavage were shown to vary 
among different cell lines, implicating the involvement of a cellular, 
rather than viral, protease (Frana et al., 1985). Consistent with this, 
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an MHV strain A59 mutant isolated from persistently infected glial 
cells was found to have an altered cleavage site, RRADR instead of 
the wild-type RRAHR (Gombold et al., 1993); this change caused an 
extreme delay, but not abrogation, of fusion of infected cells. Studies of 
expressed MHV S proteins with wild-type or mutated cleavage sites 
gave essentially the same results, showing that the fusion delay was 
strictly a property of mutant S protein (Bos et al., 1995; Stauber et al., 
1993; Taguchi, 1993). However, S protein was found not to be cleaved 
at all in MHV-infected primary glial cells or hepatocytes, indicating 
that cleavage was not a requirement for virus—cell fusion (Hingley 
et al., 1998). It was demonstrated that furin or a furin-like protease 
is responsible for MHV S cleavage in tissue culture (de Haan et al., 
2004). Treatment of cells with a specific furin inhibitor blocked both 
cleavage and cell-cell fusion, but it had no effect on virus—cell fusion. 
Another component of the MHV S protein that operates in cell-cell 
fusion is the cysteine-rich region of the endodomain, mutation of which 
delays or abrogates syncytia formation (Bos et al., 1995; Chang et al., 
2000). It is currently not known how this segment of the S molecule, 
which is on the opposite side of the membrane from the six-helix 
bundle, participates in the fusion process. The cysteine-rich region of 
the endodomain is a possible target for palmitoylation (Bos et al., 
1995), which is a known modification of MHV S (Niemann and Klenk, 
1981), but, as yet, a role for palmitoylation has not been established. 


B. Virion Assembly Interactions 


Once the full program of viral gene expression is underway, through 
transcription, translation, and genome replication, progeny viruses 
can begin to assemble. Coronavirus virion assembly occurs through a 
series of cooperative interactions that occur in the ER and the ERGIC 
among the canonical set of structural proteins, S, M, E, and N. The M 
protein is a party to most, if not all, of these interactions and has come 
to be recognized as the central organizer of the assembly process. 
Despite its dominant role, however, M protein alone is not sufficient 
for virion formation. Independent expression of M protein does not 
result in its assembly into virion-like structures. Under these circum- 
stances, M was shown to traverse the secretory pathway as far as the 
trans-Golgi (Klumperman et al., 1994; Machamer and Rose, 1987; 
Machamer et al., 1990; Rottier and Rose, 1987; Swift and Machamer, 
1991), where it forms large, detergent-insoluble complexes (Krijnse 
Locker et al., 1995; Weisz et al., 1993). By contrast, MHV, IBV, TGEV, 
and FIPV, representative species from each of the three coronavirus 
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groups, were found to bud into a proximal compartment, the ERGIC 
(Klumperman et al., 1994; Krijnse Locker et al., 1994; Tooze et al., 
1984, 1988). These observations suggested that some factor, in addition 
to M, must determine the site of virion assembly and budding. 

The identification of the unknown factor came from the development 
of virus-like particle (VLP) systems for coronaviruses. Such studies 
showed that, for MHV, coexpression of both M protein and the minor 
virion component, E protein, was necessary and sufficient for the 
formation of particles (Bos et al., 1996; Vennema et al., 1996). The 
resulting VLPs were morphologically identical to virions (minus 
spikes) and were released from cells by a pathway similar to that used 
by virions. Notably, neither the S protein nor the nucleocapsid was 
found to be required for VLP formation. These results were subse- 
quently generalized for coronaviruses from all three groups: BCoV 
and TGEV (Baudoux et al., 1998), IBV (Corse and Machamer, 2000, 
2003), and SARS-CoV (Mortola and Roy, 2004). Currently, there is one 
known exception to this trend: in a separate study of SARS-CoV, M and 
N proteins were reported to be necessary and sufficient for VLP forma- 
tion, whereas E protein was dispensable (Huang et al., 2004a). This 
latter contradiction remains to be resolved. It may reflect a unique 
aspect of SARS-CoV virion assembly, or, alternatively, it may indicate 
that VLP requirements can vary with different expression systems. 


1. M Protein—M Protein Interactions 


Since VLPs contain very little E protein, it is assumed that lateral 
interactions between M protein monomers are the driving force for 
virion envelope formation. These interactions have been explored 
through examination of the ability of constructed M protein mutants 
to support or to interfere with VLP formation. A study that tested the 
structural requirements of the M protein found that mutations either 
in the ectodomain, or in any of the three transmembrane domains, or 
in the carboxy-terminal endodomain, could inhibit or abolish VLP 
formation (de Haan et al., 1998a). In particular, the carboxy terminus 
of M was extremely sensitive to small deletions or even to point muta- 
tions of the final residue of the molecule. Construction of many of these 
latter mutations in the viral genome revealed a consistent set of effects 
on viral viability. Yet, virions were better able than VLPs to tolerate 
carboxy-terminal alterations in M protein, presumably because virions 
were stabilized by additional intermolecular interactions not present 
in VLPs. In experiments in which both wild-type and mutant M pro- 
teins were coexpressed with E protein, wild-type M protein was able to 
rescue low concentrations of assembly-defective mutant M proteins 
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into VLPs (de Haan et al., 1998a). This finding, coupled with results 
from coimmunoprecipitation analyses, provided the basis for further 
work, which concluded that monomers of M interact via multiple con- 
tacts throughout the molecule and particularly in the transmembrane 
domains (de Haan et al., 2000). 


2. S Protein—M Protein Interactions 


That VLPs could be formed in the absence of S protein (Bos et al., 
1996; Vennema et al., 1996) confirmed the much earlier discovery that 
treatment of MHV-infected cells with the glycosylation inhibitor tuni- 
camycin led to the assembly and release of spikeless (and consequently, 
noninfectious) virions (Holmes et al., 1981; Rottier et al., 1981). These 
findings were also consistent with the properties of certain classical 
temperature-sensitive mutants of MHV and IBV, which, owing to S 
gene lesions, failed to incorporate spikes into virions at the nonpermis- 
sive temperature (Luytjes et al., 1997; Ricard et al., 1995; Shen et al., 
2004). Independently expressed MHV, FIPV, or IBV S proteins enter 
the default secretory pathway and ultimately reach the plasma mem- 
brane (Vennema et al., 1990). In the presence of M protein, however, a 
major fraction of S is retained in intracellular membranes, as was 
shown by coimmunoprecipitation of S and M proteins from MHV- 
infected cells (Opstelten et al., 1995). Moreover, the interaction of 
M with S was demonstrated to be specific; complexes of M did not 
impede the progress of a heterologous glycoprotein (the VSV G protein) 
to the plasma membrane. Additionally, kinetic experiments revealed 
that the folding and oligomerization of S protein in the ER is rate 
limiting in the M-S interaction, in which nascent M protein immedi- 
ately participates (Opstelten et al., 1995). Complexes of the M and S 
proteins were similarly observed in BCoV-infected cells, for which it was 
found that M also determines the selection of HE protein for incorpora- 
tion into virions (Nguyen and Hogue, 1997). The simplest picture to 
be drawn from all this evidence, then, is that S protein is entirely 
passive in assembly but becomes trapped by M protein upon passage 
through the ER. Nevertheless, there are indications that, in some cases, 
S cooperates in its own capture. By the criterion of acquisition of endo 
H resistance, independently expressed S protein was found to be trans- 
ported to the cell surface with much slower kinetics than S protein 
that was incorporated into virions. This led to the proposal that free 
S protein harbors intracellular retention signals that become hidden 
during virion assembly (Vennema et al., 1990). Such signals have been 
found in the (group 3) IBV S protein cytoplasmic endodomain, which 
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contains both a dilysine motif that was shown to specify retention in the 
ERGIC and a tyrosine-based motif that causes retrieval by endocytosis 
from the plasma membrane (Lontok et al., 2004). Additionally, a novel 
dibasic ERGIC retention signal was identified in the S protein endodo- 
mains of group 1 coronaviruses (TGEV, FIPV, and HCoV-229E) and 
SARS-CoV, but not other group 2 coronaviruses, such as MHV and 
BCoV. 

Although the S protein is not required for VLP formation, it does 
become incorporated into VLPs if it is coexpressed with the M and 
E proteins (Bos et al., 1996; Vennema et al., 1996). VLP manipulations 
thus made it possible to begin to dissect the molecular basis for the 
specific selection of S protein by M protein. As for M—M homotypic 
interactions, the sites within M protein that bind to S protein have not 
yet been pinpointed. On a broader scale, deletion mapping has indi- 
cated that the ectodomain of M protein and the carboxy-terminal 
25 residues of the endodomain do not participate in interactions with 
S, even though both of these regions are critical for VLP formation 
(de Haan et al., 1999). The residues of S protein that interact with 
M protein, on the other hand, have been much more precisely localized. 
This mapping began with the swapping of ectodomains between the 
very divergent S proteins of MHV and FIPV (Godeke et al., 2000). This 
type of exchange showed that the incorporation of S protein into VLPs 
of a given species was determined by the presence of merely the 
transmembrane domain and endodomain of S protein from the same 
species. The source of the S ectodomain did not matter. The assembly 
competence of the 1324-residue MHV S protein or the 1452-residue 
FIPV S protein was therefore restricted to just the 61-amino-acid, 
carboxy-terminal region of each of these molecules. That the domain- 
switched S molecules were completely functional was demonstrated by 
the construction of an MHV mutant, designated fMHV, in which the 
ectodomain of the MHV S protein was replaced by that of the FIPV S 
protein (Kuo et al., 2000). As predicted, this mutant gained the ability 
to grow in feline cells, while losing the ability to grow in mouse cells. 
The f{MHV chimera provided the basis for powerful selections, based on 
host cell species restriction, that have been used with the reverse 
genetic system of targeted RNA recombination (Section VI) (Kuo and 
Masters, 2002; Masters, 1999; Masters and Rottier, 2005). The con- 
verse construct, an FIPV mutant designated mFIPV, in which the 
ectodomain of the FIPV S protein was replaced by that of the MHV 
S protein, had properties exactly complementary to those of {MHV 
(Haijema et al., 2003). 
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More detailed dissection of the transmembrane domain and endodo- 
main of the MHV S protein has been carried out to further localize the 
determinants of S incorporation into virions (Bosch et al., 2005; Ye etal., 
2004). In one study, the S protein transmembrane domain, or the en- 
dodomain, or both, were swapped with the corresponding region(s) of a 
heterologous transmembrane protein, which was expressed as an extra 
viral gene product (Ye et al., 2004). Mutations were constructed in this 
surrogate virion structural protein, or, alternatively, directly in the S 
protein. From this work, the virion assembly property of S was found to 
map solely to the 38-residue endodomain, with a major role assigned to 
the charge-rich, carboxy-terminal region of the endodomain. Additional- 
ly, it was observed that the adjacent, membrane-proximal, cysteine-rich 
region of the endodomain was critical for cell-cell fusion during infection, 
consistent with results previously reported from investigations using 
S protein expression systems (Bos et al., 1995; Chang et al., 2000). A 
second study, based on analysis of a progressive series of carboxy- 
terminal truncations of the S protein in VLPs and in viral mutants, also 
mapped the virion assembly competence of S to the endodomain (Bosch 
et al., 2005). In this work, however, the major role in assembly was 
attributed to the cysteine-rich region of the endodomain, and the overall 
size, rather than the sequence of the endodomain, was seen to be critical. 
Thus, the precise nature of the interaction between the S protein endo- 
domain and the M protein remains to be resolved. 


3. N Protein—M Protein Interactions 


The interaction of the viral nucleocapsid with M protein was origi- 
nally examined by the fractionation of purified MHV virions (Sturman 
et al., 1980). At 4°C, M protein was separated from other components 
on density gradient centrifugation of NP-40-solubilized virion prepara- 
tions, but M reassociated with the nucleocapsid when the temperature 
was elevated to 37°C. Further analysis suggested that, contrary to 
expectations, this temperature-dependent association was mediated 
by M binding to viral RNA, rather than to N protein. The notion of 
M protein as an RNA-binding protein has been revived in light of 
recent results on the mechanism of genome packaging (Section IV.C) 
(Narayanan et al., 2008a). 

For TGEV virions, the use of particular low-ionic-strength condi- 
tions of NP-40 treatment similarly resulted in the finding that a frac- 
tion of M protein was persistently integrated with subviral cores (Risco 
et al., 1996). For assay of this association, in vitro-translated M protein 
was bound to immobilized nucleocapsid purified from virions (Escors 
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et al., 2001). Through the combined approaches of deletion mapping, 
inhibition by antibodies of defined specificity, and peptide competition, 
the M-nucleocapsid interaction was localized to a segment of 16 resi- 
dues adjacent to the carboxy terminus of the 262-residue TGEV 
M protein. 

Studies of MHV have taken genetic avenues to explore the N 
protein—M protein interaction. In one report, a viral mutant was con- 
structed in which the carboxy-terminal two amino acids of the 228- 
residue MHV M protein were deleted (Kuo and Masters, 2002), a lesion 
previously known to abolish VLP formation (de Haan et al., 1998a). 
The resulting highly impaired virus, designated MA2, formed tiny 
plaques and grew to maximal titers many orders of magnitude lower 
than those of the wild type. Multiple independent second-site rever- 
tants of the MA2 mutant were isolated and mapped to either the 
carboxy terminus of M or that of N. Reconstruction of some of these 
compensating mutations, in the presence of the original MA2 muta- 
tion, provided evidence for a structural interaction between the car- 
boxy termini of the M and the N proteins. In a complementary 
analysis, a set of viral mutants were created containing all possible 
clustered charged-to-alanine mutations in the carboxy-terminal do- 
main 3 of the N protein (Hurst et al., 2005). One of the members of 
this set, designated N-CCA4, was extremely defective, having a phe- 
notype similar to that of the MA2 mutant. Multiple independent 
second-site suppressors of N-CCA4 were found to map in the 
carboxy-terminal region of either the N or the M protein, thereby 
reciprocating the genetic cross-talk uncovered with the MA2 mutant. 
Additionally, it was shown that the transfer of N protein domain 3 to a 
heterologous protein allowed incorporation of that protein into MHV 
virions. 


4. Role of E Protein 


In contrast to the more overt structural roles of the M, S, and N 
proteins, the part played by E protein in assembly is enigmatic. On 
discovery of the essential nature of E in VLP formation, it was specu- 
lated that the low amount of E protein in virions and VLPs indicated a 
catalytic, rather than structural, function for this factor. E protein 
might serve to induce membrane curvature in the ERGIC, or it might 
act to pinch off the neck of the viral particle in the final stage of the 
budding process (Vennema et al., 1996). In a search for evidence corre- 
lating the VLP findings to the situation in whole virions, a set of 
clustered charged-to-alanine mutations were constructed in the E gene 
of MHV. One of the resulting mutants was markedly thermolabile, and 


234 PAUL S. MASTERS 


its assembled virions had striking morphologic defects, exhibiting 
pinched and elongated shapes that were rarely seen among wild-type 
virions (Fischer et al., 1998). This phenotype clearly supported a criti- 
cal role for E protein in virion assembly. Surprisingly, however, it was 
later found to be possible to entirely delete the E gene from the MHV 
genome, although the resulting AE mutant virus was only minimally 
viable compared to the wild type (Kuo and Masters, 2003). This in- 
dicated that, for MHV, the E protein is important, but not absolutely 
essential, to virion assembly. By contrast, for TGEV, two independent 
reverse genetic studies showed that knockout of the E gene was lethal. 
Viable virus could be recovered only if E protein was provided in trans 
(Curtis et al., 2002; Ortego et al., 2002). This discordance may point to 
basic morphogenic differences between group 2 coronaviruses (such 
as MHV) and group 1 coronaviruses (such as TGEV). Alternatively, it 
is possible that E protein has multiple activities, one of which is 
essential for group 1 coronaviruses but is largely dispensable for group 
2 coronaviruses. 

The information available about E protein at this time is not suffi- 
ciently complete to allow us to understand the function of this tiny 
molecule. One of the most intriguing questions is whether it is neces- 
sary for E protein to directly physically interact with M protein, or 
whether E acts at a distance. If E protein has multiple roles, then 
perhaps both of these possibilities are applicable. Direct interaction 
between the E and M proteins is implied by the observation that, at 
least in some cases, coexpression of E and M proteins from different 
species does not support VLP formation (Baudoux et al., 1998). The 
demonstration that IBV E and M can be cross-linked to one another 
also has established that the two proteins are in close physical proxim- 
ity in infected or transfected cells (Corse and Machamer, 2003). 
Contrary to this, some data appear to argue that E acts independently 
of M. The individual expression of MHV or IBV E protein results in 
membrane vesicles that are exported from cells (Corse and Machamer, 
2000; Maeda et al., 1999). Additionally, it has been shown that the 
expression of MHV E protein alone leads to the formation of clusters of 
convoluted membranous structures highly similar to those seen in 
coronavirus-infected cells (Raamsman et al., 2000). This suggests that 
the E protein, without other viral proteins, acts to induce membrane 
curvature in the ERGIC. Some indirect evidence may also be taken to 
indicate that E does not directly contact other viral proteins. Multiple 
revertant searches with E gene mutants failed to identify any suppres- 
sor mutations that map in M or in any gene other than E (Fischer 
et al., 1998). Similarly, none of the intergenic suppressors of the MA2 
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mutant mapped to the E gene (Kuo and Masters, 2002). It has been 
found that the SARS-CoV E protein forms cation-selective ion channels 
in a model membrane system (Wilson et al., 2004). Moreover, this 
channel-forming property was contained in the amino-terminal 40 
residues of the 76-residue SARS-CoV E molecule. Such an activity 
made be the basis for an independent mode of action of E protein. 


C. Genome Packaging 


Although a variety of positive- and negative-strand viral RNA spe- 
cies are synthesized during the course of infection (Section V), corona- 
viruses selectively incorporate genomic (positive-strand) RNA into 
assembled virions. This may be accomplished with varying degrees of 
stringency by different members of the family. Sucrose gradient- 
purified virions of MHV have been found to exclusively contain geno- 
mic RNA (Makino et al., 1990). By contrast, similarly purified virions 
of BCoV (Hofmann et al., 1990), TGEV (Sethna et al., 1989, 1991), and 
IBV (Zhao et al., 1993) have been reported to contain significant quan- 
tities of subgenomic mRNA, in some cases in molar amounts exceeding 
those of the genomic RNA. However, in a study of TGEV, in which 
virions were extensively purified by an ELISA-based immunopurifica- 
tion procedure, a very high degree of selectivity for genomic RNA 
packaging was observed (Escors et al., 2003). 

In those viruses in which it has been mapped, the RNA element that 
specifies selective packaging falls, as would be expected, in a region of 
the genome that is not found in any of the subgenomic mRNAs. In 
MHV, the genomic packaging signal was localized through analysis of 
defective interfering (DI) RNAs. DI RNAs are extensively deleted 
variants of the genome that propagate as molecular parasites, using 
the replicative machinery of a helper virus. Some DI RNAs are pack- 
aged efficiently, while others have lost such a capability. Dissection of 
particular members of the former class revealed that a relatively small 
span of internal sequence could account for packaging competence 
(Makino et al., 1990; van der Most et al., 1991). The exact boundaries 
of the MHV packaging signal are not precisely defined, but reports 
from different groups have converged on RNA segments of 180-190 nt, 
within a 220-nt region that is centered some 20.3 kb from the 5’ end of 
the genome (Fosmire et al., 1992; Molenkamp and Spaan, 1997). The 
MHV packaging element is thus embedded in the coding sequence of 
nsp15, at the distal end of the replicase gene. A core 69-nt RNA 
secondary structural element can act as a minimal signal for packag- 
ing (Fosmire et al., 1992; Woo et al., 1997), but larger versions of the 
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element, consisting of the core plus flanking sequences, function more 
efficiently (Cologna and Hogue, 2000; Narayanan and Makino, 2001). 
Even the larger versions of the element may not be entirely sufficient, 
however: some data suggest that other cis-acting sequences found 
in genomic, but not subgenomic, DI RNA contribute to the overall 
efficiency of packaging (Bos et al., 1997). 

For the closely related group 2 coronavirus BCoV, the 190-nt geno- 
mic region homologous to the MHV packaging signal has been shown 
to have the same function as its MHV counterpart. Moreover, the MHV 
and BCoV packaging signals are able to act in a reciprocal fashion: a 
nonviral RNA containing the MHV packaging signal can be packaged 
by BCoV helper virus, and a nonviral RNA containing the BCoV 
packaging signal can be packaged by MHV helper virus (Cologna and 
Hogue, 2000). This functional homology does not appear to extend 
across group boundaries, though. For the group 1 coronavirus TGEV, 
the packaging signal was also shown to be retained in particular DI 
RNAs, which were found to be incorporated into defective virions that 
could be separated from helper virus by density gradient centrifuga- 
tion (Mendez et al., 1996). Surprisingly, dissection of the smallest 
packaged DI RNA revealed that the packaging signal for TGEV maps 
to the upstream end of the replicase gene, localizing in the region of 
100—649 nt from the 5’ end of the genome (Escors et al., 2003). For the 
group 3 coronavirus IBV, a packaged DI RNA has been isolated and 
characterized (Penzes et al., 1994), but mapping of the packaging 
element in this RNA has thus far been inconclusive, owing to the need 
to decouple requirements for replication from those for packaging 
(Dalton et al., 2001). Nevertheless, it is clear that the IBV DI RNA 
does not harbor a region of the IBV genome homologous to the region 
that contains the packaging signal in MHV. Similarly, the IBV DI RNA 
may also lack the counterpart of the TGEV packaging signal. It will be 
interesting to see whether the packaging signals of viruses in the three 
coronavirus groups, once they are completely characterized, are found 
to retain structural similarities despite differences in sequence and 
location. 

The mechanism by which packaging signals operate is not yet clear, 
and results with MHV have in fact taken an unanticipated turn. In 
this context, it is important to note the distinction between encapsi- 
dation and packaging, two terms that are often used interchangeably 
in the coronavirus literature. Encapsidation is the process of formation 
of the nucleocapsid, that is, the cooperative binding of N protein to 
viral RNA. Packaging is the incorporation of the nucleocapsid into 
virions. For enveloped viruses, the two processes are not necessarily 


THE MOLECULAR BIOLOGY OF CORONAVIRUSES 237 


the same. For example, for nonsegmented negative-strand viruses, 
both genomic and antigenomic RNA are encapsidated, but only geno- 
mic RNA is packaged. For coronaviruses, it was logical to assume that 
encapsidation is initiated by the N protein. Indeed, specific binding of 
MHV N protein to the packaging signal RNA has been demonstrated 
in vitro (Molenkamp and Spaan, 1997). However, in vitro RNA binding 
experiments have also shown a specific interaction between the 
MHV N protein and the leader RNA, which is located at the 5’ end 
of subgenomic and genomic RNA (Nelson et al., 2000; Stohlman 
et al., 1988). It remains to be seen whether either of these sequence- 
specific modes of RNA binding represents a nucleation step ultimately 
leading to encapsidation by multiple monomers of N. The binding of 
N to leader RNA appears incongruent with the specificity of packaging, 
but it is consistent with the observation that anti-N antibodies 
coimmunoprecipitate both subgenomic and genomic RNA from cells 
infected with MHV or BCoV (Baric et al., 1988; Cologna et al., 2000; 
Narayanan et al., 2000). A possible resolution of this paradox has come 
from findings that reveal a role for M protein in the selectivity of 
packaging. Antibodies to MHV M protein were shown to coimmunopre- 
cipitate the fraction of N protein that is bound to genomic RNA, but 
not N protein that is bound to subgenomic RNA (Narayanan et al., 
2000). Furthermore, this specific M-N interaction is dependent on 
the presence of the MHV packaging signal (Narayanan and Makino, 
2001). Remarkably, recent work with coexpressed MHV proteins has 
attributed the direct selection of packaging signal RNA to the M protein. 
Thus, VLPs formed by M and E proteins, but devoid of N protein, were 
found to incorporate a heterologous RNA molecule only if it contained the 
MHV packaging signal (Narayanan et al., 2003a). If this discovery turns 
out to generalize to all coronaviruses, then it will mean that M protein 
orchestrates every single interaction necessary for virion assembly. 


V. RNA SYNTHESIS 


A. Replication and Transcription 


Coronavirus RNA synthesis proceeds by a complex and incompletely 
understood mechanism, portions of which involve interactions be- 
tween distant segments of the genome (Lai and Cavanagh, 1997; Lai 
and Holmes, 2001; van der Most and Spaan, 1995). Following its 
translation into the replicase polyproteins, the genomic RNA (gRNA) 
next acts as the template for synthesis of negative-sense RNA species. 
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Further events produce a series of smaller, subgenomic RNAs 
(sgRNAs) of both polarities (Fig. 6) (Baric and Yount, 2000; Sethna 
et al., 1989, 1991). The positive-sense sgRNAs, each of which serves as 
the message for one of the ORFs downstream of the replicase ORF, 
have compositions equivalent to large genomic deletions. Positive- 
sense sgRNAs contain a 70—100-nt leader RNA, which is identical to 
the 5’ end of the genome, joined at a downstream site to a stretch of 
sequence (the body of the sgRNA), which is identical to the 3’ end of 
the genome. Collectively, the sgRNAs are said to form a 3/-nested set. 
The 3’-nested set of sgsRNAs, with or without a leader sequence, is a 
defining feature of the order Nidovirales (Enjuanes et al., 2000a; van 
Vliet et al., 2002). The negative-sense sgRNAs, roughly a tenth to a 
hundredth as abundant as their positive-sense counterparts, each 
possess the complement of this arrangement, including a 5’ oligo(U) 
tract of 9-26 residues (Hofmann and Brian, 1991) and a 3’ antileader 
(Sethna et al., 1991). 

Many advances in understanding the mechanism of coronavirus 
RNA synthesis were facilitated by the discovery and cloning of DI 
RNAs of MHV (Makino et al., 1985, 1988; van der Most et al., 1991) 
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and, subsequently, of other coronaviruses (Chang et al., 1994; Mendez 
et al., 1996; Penzes et al., 1994). Because they are extensively deleted 
genomic variants that propagate by competing for the viral RNA syn- 
thesis machinery, DI RNAs have evolved to retain cis-acting sequence 
elements necessary for replication. Manipulations of naturally occur- 
ring and artificially constructed DI RNAs, which are studied by trans- 
fection into infected cells, enabled the mapping of elements from the 
genome that participate in replication and transcription (Brian and 
Baric, 2005). 

In studies of replication, deletion analyses of various cloned MHV DI 
RNAs have demonstrated that either 466, 474, or 859 nucleotides at 
the 5’ end of the MHV genome are required to support replication (Kim 
et al., 1993; Lin and Lai, 1993; Luytjes et al., 1996). The exact magni- 
tude of this value appears to have been dependent on which MHV 
genomic regions were present in the individual DI RNA with which a 
particular analysis was begun. In the very closely related BCoV, 498 
nucleotides at the 5’ end of a naturally occurring DI RNA have been 
shown to suffice for replication (Chang et al., 1994). For TGEV and IBV, 
the minimal 5’ cis-acting replication signals have thus far been limited 
to 1348 and 544 nucleotides, respectively (Dalton et al., 2001; Izeta 
et al., 1999). In all cases, this region extends well beyond the leader 
RNA and includes a portion of the 5’ end of the replicase ORF. This 
means that coronavirus sgRNAs do not have a sufficient extent of 
5’ sequence to function as replicons, as was once proposed (Sethna 
et al., 1989). Only in BCoV has the 5’ cis-acting replication signal been 
further defined. Detailed dissections of this element, through structural 
probing and functional mutational analyses, have identified four stem- 
loop structures essential for RNA replication (Chang et al., 1994, 1996; 
Raman and Brian, 2005; Raman et al., 2003). For stems III and IV, 
secondary structure, rather than primary sequence, has been shown 
to be of functional importance; these structures were found to be con- 
served in the more closely related group 2 coronaviruses but not in 
SARS-CoV. 

At the other end of the genome, deletion analyses found that the 
minimal stretch of the 3’ terminus able to sustain MHV DI RNA 
replication falls between 436 and 462 nucleotides (Kim et al., 1993; 
Lin and Lai, 1993; van der Most et al., 1995). Notably, this range of 
sequence would include a portion of the adjacent N gene as well as the 
entire 301-nucleotide 3’ UTR. By contrast, the minimal 3’ cis-acting 
replication signals for TGEV and IBV were 492 and 338 nucleotides, 
respectively. DI RNAs containing such minimal elements were devoid 
of any part of the N gene (Dalton et al., 2001; Izeta et al., 1999). 
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Consistent with this latter finding, it was shown for engineered mu- 
tants of MHV that translocation of the N gene to an upstream genomic 
position had no effect on replication (Goebel e¢ al., 2004a). This argues 
strongly that no essential 3’ cis-acting region is present in the N gene 
within the intact MHV genome. If any such region does exist, it must 
be able to act at a distance of nearly 1.5 kb. Given the requirement in 
MHV for the entire 3’ UTR, it was somewhat paradoxical when further 
study showed that a minimum of 45-55 nucleotides at the 3’ end of 
the genome, plus an indeterminate amount of poly(A) tail, sufficed to 
support negative-strand RNA synthesis (Lin et al., 1994). From this 
result it was concluded that the promoter for negative-strand initia- 
tion lies completely within the last 55 nucleotides of the genome and 
that the remainder of the 3’ cis-acting element must be required for 
positive-strand RNA synthesis. Alternatively, the 3/-most 45-55 
nucleotides of the genome may constitute the minimal region able to 
associate in trans with helper virus genome so as to allow initiation of 
negative-strand synthesis. A finer examination of the 3’ poly(A) tail 
requirement found that, for both MHV and BCoV DI RNAs, no fewer 
than 5-10 A residues are necessary for replication, and there is a 
correlation between DI RNA replication competence and the ability 
to bind poly(A)-binding protein (Spagnolo and Hogue, 2000). 

Further investigation of the 3’ UTR in MHV and BCoV has produced 
a fairly complete picture of the RNA landscape of this region. At the 
upstream end of the 3’ UTR, two functionally essential structures have 
been demonstrated by chemical and enzymatic probing and by genetic 
studies with both DI RNAs and constructed viral mutants. The first 
structure is a bulged stem-loop (Hsue and Masters, 1997; Hsue et al., 
2000; Goebel et al., 2004a); the second is an adjacent RNA pseudoknot 
(Goebel et al., 2004a; Williams et al., 1999). An intriguing property of 
these upstream RNA elements is that they partially overlap, that is, 
the bulged stem-loop and the pseudoknot would not be able to fold up 
simultaneously. It has thus been proposed that they constitute compo- 
nents of a molecular switch that is operative at some stage of RNA 
synthesis, although a target of their putative regulation has not yet 
been identified (Goebel et al., 2004a). Further downstream in the MHV 
genome is a complex RNA secondary structural element that takes up 
most of the remainder of the 3’ UTR (Johnson et al., 2005; Liu et al., 
2001). Although this structure is only poorly conserved with the struc- 
ture predicted for the corresponding region of the BCoV 3’ UTR, muta- 
tions made in one stem that is highly conserved between the two 
viruses were found to be deleterious to DI RNA replication. Surpris- 
ingly, in the heart of this most divergent region of the 3’ UTR is found 
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an octanucleotide motif, 5'-GGAAGAGC-3’, that is absolutely conserved 
in the 3’ UTRs of all coronaviruses in all three groups. 

The presence of the 3’ UTR stem-loop and pseudoknot appears to 
be a distinguishing feature of the group 2 coronaviruses. The group 1 
coronaviruses all contain a highly conserved pseudoknot (Williams 
et al., 1999), but no detectable counterpart of the bulged stem-loop in 
either upstream or downstream proximity to it. On the other hand, the 
group 3 coronaviruses have a highly conserved and functionally essen- 
tial stem-loop (Dalton et al., 2001), but merely a poor candidate for the 
pseudoknot structure can be found nearby (Williams et al., 1999). Only 
the group 2 coronaviruses have both elements, and, in all cases, the 
elements overlap in the same fashion. Despite sequence divergence 
among the 3’ UTRs of group 2 coronaviruses, these genomic segments 
are functionally equivalent. The BCoV 3’ UTR was found to be able to 
entirely replace the MHV 3’ UTR (Hsue and Masters, 1997). Moreover, 
it was demonstrated that replication of a BCoV DI RNA could be 
supported by any of a number of closely related group 2 helper viruses, 
including MHV (Wu et al., 2003). More strikingly yet, the SARS-CoV 3’ 
UTR was found to be able to entirely replace the MHV 3’ UTR (Goebel 
et al., 2004b). Thus, the replicase machinery of a group 2 coronavirus, 
MHYV, is able to recognize and use the 3’ cis-acting structures and 
sequences of other group 2 coronaviruses, BCoV and SARS-CoV. By 
contrast, the MHV 3’ UTR cannot be replaced with either the group 1 
TGEV 3’ UTR or the group 3 IBV 3’ UTR. 

Numerous investigations have focused on the intriguing nature of 
coronavirus sgRNA transcription. The sites of leader-to-body fusion in 
the sgRNAs occur at loci in the genome that contain a short run of 
sequence that is identical, or nearly identical, to the 3’ end of the leader 
RNA (Fig. 6). These sites are called transcription-regulating sequences 
(TRSs); they have also been designated transcription-associated se- 
quences (TASs) or intergenic sequences (IGs or IGSs). TRSs are fairly 
well conserved within each coronavirus group. The core consensus TRS 
is 5/-AACUAAAC-3’ for group 1; 5/-AAUCUAAAC-3’ for group 2 (except 
for SARS-CoV, for which it is 5’-AAACGAAC-3’); and 5/-CUUAACAA-3’ 
for group 3 (Thiel et al., 2003a; van der Most and Spaan, 1995). 
Not every TRS in a given virus conforms exactly to the consensus 
sequence; a number of allowable variant bases are found in individual 
TRSs. 

It was clear from very early studies that the sgRNAs are formed by a 
discontinuous, cotranscriptional process and that they are not pro- 
duced by splicing of a full-length genomic precursor (Jacobs et al., 
1981; Stern and Sefton, 1982). As for RNA replication, the first 
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systematic means of addressing the mechanism of transcription came 
from the manipulation of engineered DI RNAs. The efficiency of fusion 
at a given TRS was at first thought to be mediated solely by base-pairing 
between the 3’ end of the leader and the complement of the TRS. 
However, studies with DI RNAs containing authentic and mutated 
TRSs led many investigators to conclude that, beyond a minimum 
threshold of potential base pairing, other factors must predominate 
(Hiscox et al., 1995; Makino et al., 1991; van der Most et al., 1994). DI 
RNA studies thus provided the first indication of the importance of the 
local sequence context of the TRS and the position of the TRS relative to 
the 3’ end of the genome (Joo and Makino, 1995; Krishnan et al., 1996; 
Ozdarendeli et al., 2001; van Marle et al., 1995). 

The original conceptual framework for many studies was that of 
leader-primed transcription. In this model, sgRNAs were envisioned 
to be generated during positive-strand RNA synthesis. It was proposed 
that the polymerase pauses near the end of the leader sequence and 
detaches with the nascent free leader RNA. This step is followed by 
reattachment of the leader RNA to the complement of a TRS at an 
internal portion of the negative-strand template, from where the na- 
scent RNA is then elongated (Lai, 1986). A refinement of this idea was 
that leader-to-body fusion results from quasi-continuous synthesis 
across two distant portions of a looped-out template, which are brought 
together via protein-RNA and protein-protein interactions (Lai et al., 
1994; Zhang et al., 1994). 

More recently, accumulated experimental results, while retaining 
the notion of a looped-out template, have been taken to support a 
mechanism in which the discontinuous step in sgRNA synthesis occurs 
during negative-strand RNA synthesis (Fig. 7) (Sawicki and Sawicki, 
1998, 2005). In this model, the viral polymerase, starting from the 3’ 
end of a genomic template, switches templates at an internal TRS and 
resumes synthesis at the homologous TRS sequence at the 3’ end of 
the genomic leader RNA. The resulting negative-strand sgRNA, in 
association with positive-strand gRNA, then serves as the template 
for synthesis of multiple copies of the corresponding positive-strand 
sgRNA. This new view originated with the discovery of negative- 
strand sgRNAs (Sethna et al., 1989) and with the demonstration that 
free leader RNA could not be detected in infected cells (Chang et al., 
1994). Most (Baric and Yount, 2000; Sawicki and Sawicki, 1990; 
Sawicki et al., 2001; Schaad and Baric, 1994), although not all (An 
and Makino, 1998; An et al., 1998; Mizutani et al., 2000), subsequent 
biochemical work supported the contention that the negative-strand 
sgRNA species are kinetically competent to serve as templates for 
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Fic 7. Model for discontinuous negative-strand transcription. Negative-strand 
sgRNAs are initiated at the 3’ end of the gRNA template. Elongation proceeds as far as 
a body copy of a transcription-regulating sequence (TRS). A strand-switching event then 
occurs, pairing the newly transcribed negative-sense body TRS with the leader copy 
of the TRS, from which point transcription resumes. A complex of the (+)gRNA and 
the (—)sgRNA then serves as the template for synthesis of multiple (+)sgRNAs. 
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positive-strand sgRNAs. In addition, some of the strongest evidence for 
negative-strand discontinuous sgRNA synthesis came from landmark 
studies using a full-length infectious cDNA of equine arterivirus, the 
prototype member of the closely related arterivirus family. This work 
made use of a robust system in which both the leader copy and one or 
multiple body copies of the TRS were singly or simultaneously mutated 
in the genome; RNA synthesis in this system was able to be assayed in 
the initial passage of infectious RNA (Pasternak et al., 2001, 2003, 
2004; van Marle et al., 1999). The arterivirus results have been corro- 
borated, in part, by experiments enabled by the development of reverse 
genetic approaches for TGEV and MHV (Alonso et al., 2002; Curtis 
et al., 2004; de Haan et al., 2002a,b; Sola et al., 2005; Zuniga et al., 
2004). At this time, there is a broad, but not universal, consensus that 
for coronaviruses, as well as for other nidoviruses, both replication and 
transcription initiate with negative-strand RNA synthesis. However, 
much further work needs to be done to elucidate the details of the 
template-switching step of discontinuous transcription. It will also be 
necessary to extend to the coronaviruses principles that have been 
more clearly established for the arteriviruses. 


B. RNA Recombination 


An important feature of coronavirus RNA synthesis is the high rate 
of homologous and nonhomologous RNA-RNA recombination that has 
been demonstrated to occur among selected and unselected markers 
during the course of infection. Although most experimental work in 
this area has been performed with MHV (Keck et al., 1987, 1988a,b; 
Makino et al., 1986, 1987), a high frequency of homologous recombina- 
tion is clearly an attribute of the entire coronavirus family, given 
that it has been observed in other viruses in all three groups: TGEV 
(Sanchez et al., 1999), FIPV (Haijema et al., 2003; Herrewegh et al., 
1998), BCV (Chang et al., 1996), and IBV (Cavanagh et al., 1992; 
Kottier et al., 1995; Kusters et al., 1990; Wang et al., 1993). In addition, 
nonhomologous recombination was likely, in all three groups, to be the 
mechanism of acquisition of the various accessory protein genes. 

RNA recombination is thought to result from a copy-choice mecha- 
nism, as originally described for poliovirus (Kirkegaard and Baltimore, 
1986). In this scheme, the viral polymerase, with its nascent RNA 
strand intact, detaches from one template and resumes elongation at 
the identical position, or a similar position, on another template. In 
MHV, recombination has been shown to take place along the entire 
length of the genome at an estimated frequency of 1% per 1.3 kb 
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(almost 25% over the entire genome), the highest rate observed for any 
RNA virus (Baric et al., 1990). On a fine scale, the sites of recombina- 
tion were seen to be random (Banner and Lai, 1991), although strong 
selective pressures were able to create the appearance of local cluster- 
ing of recombinational hot spots in one study (Banner et al., 1990). 
Some results suggest that the rate of recombination increases across 
the entire MHV genome, from 5’ to 3’ end (Fu and Baric, 1992, 1994). 
This gradient may result from homologous recombination between 
genomic and subgenomic RNAs, since the latter would provide a source 
of donor and acceptor templates that would become more numerous as 
a function of proximity to the 3’ end of the genome. 

Most evidence supports a model for viral RNA recombination having 
three mechanistic requirements (Lai, 1992). First, the RNA polymer- 
ase must pause during synthesis. This may be an intrinsic property of 
the enzyme, or it may result from the enzyme encountering a template 
secondary structure that exceeds a certain stability threshold. Second, 
a new template must be in physical proximity. Third, some property of 
the new template must allow the transfer of the nascent RNA strand 
and the resumption of RNA synthesis. Alternatively, strand transfer 
could result from a processive mechanism that does not require poly- 
merase dissociation (Jarvis and Kirkegaard, 1991). For poliovirus, 
classical experiments showed that RNA recombination occurs during 
negative-strand RNA synthesis (Kirkegaard and Baltimore, 1986), 
most likely because positive-strand acceptor templates far outnumber 
negative strands (Jarvis and Kirkegaard, 1992). The same is likely to 
be true for coronaviruses, since they, too, have a high ratio of positive- 
strand to negative-strand RNA (Sawicki and Sawicki, 1986, 1990; 
Sethna et al., 1989). Moreover, for MHV, most or all negative-strand 
RNA is found duplexed with positive-strand RNA (Lin et al., 1994; 
Sawicki and Sawicki, 1986). Thus, there may be a bias toward 
negative-strand recombination simply because positive-strand RNA 
is the most available (single-stranded) acceptor template. However, 
instances of coronavirus homologous recombination that occurred dur- 
ing positive-strand RNA synthesis have been documented (Liao and 
Lai, 1992). Also, work with extremely defective MHV mutants has 
shown that sufficiently strong selective pressures can reveal unusual 
nonhomologous rearrangements, including recombination between 
negative- and positive-strand RNA, which are likely to be constantly 
occurring at a low frequency during viral RNA synthesis. 

One form of nonhomologous recombination that occurs between 
genomic and subgenomic RNA has been hypothesized to result from 
the collapse of the transcription complex during negative-strand 
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discontinuous transcription (Kuo and Masters, 2002). Such a disrup- 
tion, followed by resumption of replicative antigenome synthesis, 
would leave a partial copy of the leader sequence embedded at an 
internal point in the genome, near the junction between two genes. 
This type of recombinant was selected repeatedly in revertants of a 
severely impaired MHV M protein mutant. However, similar tran- 
scriptional collapse events may have been a significant factor in coro- 
navirus evolution. Remnants of leader RNAs were found in the 
genomes of wild-type HCoV-OC43 (Mounir and Talbot, 1993) and in a 
mutant of MHV strain S (Taguchi et al., 1994). Most strikingly, the 
recently described HCoV-HKU1 genome contains two very significant 
segments of embedded leader sequence (Woo et al., 2005). Each of these 
leader remnants occurs at a site where there is an apparent deletion of 
an entire accessory gene, with respect to the genomic layouts of the 
closest relatives of this virus, MHV and BCoV. 


C. Replicase Complex 
1. Ribosomal Frameshifting 


The replicase complex that carries out the intricacies of viral RNA 
replication and transcription is encoded by the first gene of the corona- 
virus genome. This huge gene occupies roughly two-thirds of the 
genome and contains two ORFs, the complete expression of which is 
dependent on a programmed ribosomal frameshift. The discovery of 
coronavirus ribosomal frameshifting resulted from the completion 
of the sequence of IBV, the first member of the family for which an 
entire genomic sequence was obtained (Brierley et al., 1987). This 
revealed a small (43 nt) overlap between ORF 1a (11.9 kb) and ORF 
1b (8.1 kb), the latter in the —1 frame relative to the former; moreover, 
there was no sgRNA that could serve as the mRNA for ORF 1b. This 
arrangement was subsequently found to exist for all coronaviruses. 
Thus, ribosomal frameshifting, which had previously been seen only in 
retroviruses (Jacks et al., 1988), was proposed as a mechanism for 
expression of ORF 1b. Programmed frameshifting was demonstrated 
for the IBV gene Ja/16 overlap region in reporter gene constructs in 
experiments using in vitro translation systems and, in some cases, 
cellular expression systems (Brierley et al., 1989). In such systems, a 
frameshifting incidence of 25-30% was measured, representing an 
efficiency far greater than the 5% seen at the retroviral gag-pol junc- 
tion. It should be noted, however, that the efficiency of in vivo frame- 
shifting occurring in cells infected with IBV, or any other coronavirus, 
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has not yet been quantitated; nor is it known whether that value 
remains constant over the course of infection. 

IBV ribosomal frameshifting was found to depend on two genomic 
RNA elements (Fig. 8): a heptanucleotide “slippery sequence” 
(UUUAAAC) and a downstream, hairpin-type pseudoknot (Brierley 
et al., 1989). In addition, the spacing between these elements is 
critical. It is thought that the pseudoknot impedes the progress of 
the elongating ribosome. With some fixed probability, the delay re- 
quired for the ribosome to melt out this secondary structural element 
allows the simultaneous slippage of the P and A site tRNAs by one base 
in the —1 direction. Normal translational elongation then resumes. 
Studies of the kinetics of translation, using a model mRNA based on 
the IBV frameshifting region, support the idea of ribosomal pausing at 
the pseudoknot (Somogyi et al., 1993). Moreover, mutational studies of 
IBV frameshifting (Brierley et al., 1989) and direct mass spectrometric 
analysis of the SARS-CoV frameshifted polypeptide product (Baranov 
et al., 2005) have confirmed both the locus of the slippage site and the 
occurrence of simultaneous slippage. The reason why coronaviruses 
employ ribosomal frameshifting as a gene expression strategy is less 
well established at this time. The explanation most commonly given is 
that, as for retroviruses, the frameshifting mechanism provides a 
fixed ratio of translation products, in the necessary proximity of one 
another, for assembly into a macromolecular complex. It could also be 
speculated that frameshifting forestalls expression of the enzymatic 
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Fic 8. RNA elements required for ribosomal frameshifting. The expanded region 
shows RNA sequences and secondary structures that program the frameshift, using 
IBV as an example. 
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products of ORF 1b until a platform and a cellular environment for 
them have been prepared by the products of ORF 1a. 

The two genomic components required for ribosomal frameshifting 
have been investigated in considerable detail. Exhaustive mutagenesis 
of the slippery sequence showed that frameshifting could be facilitated 
by a number of heptameric sequences of the form XXXYYYN, where 
XXX and YYY are the postslippage P and A site codons, respectively 
(Brierley et al., 1992). Hierarchies of preferred combinations of X, Y, 
and N were defined, and these indicated a major role for the strength of 
the A-site tRNA interaction. However, although some heptanucleo- 
tides showed a frameshifting efficiency nearly as high as that of the 
wild type, it must be noted that, to date, all known coronaviruses have 
been found to contain a slippery sequence of UUUAAAC (Brian and 
Baric, 2005; Plant et al., 2005). 

The second component, the pseudoknot, has similarly been exam- 
ined through exhaustive mutagenesis (Brierley et al., 1991). Although 
the involvement of a downstream RNA secondary structural element 
in ribosomal frameshifting was first recognized with retroviruses 
(Jacks et al., 1988), the earliest demonstration that the requisite struc- 
ture is a pseudoknot came from the study of IBV (Brierley et al., 1989). 
This demonstration was initially by classic stem replacement muta- 
genesis, and, subsequently, by intensive modification of pseudoknot 
elements; all of the results of both types of studies supported the 
proposed structure. It was also revealed that the length of stem 1 is 
very important for frameshifting efficiency (Napthine et al., 1999) and 
that it is the structure, not the primary sequence, that is significant for 
both stems 1 and 2. Higher-order structure was also found to be 
critical: the pseudoknot could not be replaced by a single stem-loop of 
the same stability, containing the identical base pairs as the sum of the 
two pseudoknot stems (Brierley et al., 1991). 

The frameshifting signals of other coronaviruses have been found to 
generally conform to the rules defined for IBV, although additional 
complexities have emerged. With the completion of the genomic se- 
quences of the group 1 coronaviruses HCoV-229E (Herold and Siddell, 
1993) and TGEV (Eleouet et al., 1995), an “elaborated” pseudoknot was 
proposed for members of this group, containing a third stem falling 
within an unusually large loop 2. It is currently unresolved whether 
the group 1 elaborated pseudoknot is the operative structure in frame- 
shifting, as suggested by some mutational evidence (Herold and 
Siddell, 1993). By contrast, loop 2 can be assigned as for the other 
coronaviruses, with the extra group 1-specific element providing an 
alternative, long-range kissing loop interaction between the upstream 
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arm of pseudoknot stem 2 and the loop of a downstream stem-loop 
(Plant et al., 2005). Analysis of the sequence of the frameshifting 
region of the SARS-CoV genome led to the prediction of a third stem- 
loop within loop 2 of the pseudoknot (Ramos et al., 2004). This third 
element is situated differently from the additional stem of the group 1 
elaborated pseudoknot, but it is similar to the potential bulged stem- 
loop that was earlier proposed to reside in loop 2 of the pseudoknot of 
the torovirus Berne virus (Snijder et al., 1990). Further computational 
analysis has similarly found a possible third stem within loop 2 of the 
frameshifting pseudoknots of all coronaviruses, and the SARS-CoV 
stem 3 structure has been shown to be consistent with NMR data 
and nuclease mapping (Plant et al., 2005). The role of stem 3 in 
ribosomal frameshifting is, as yet, unclear. Contrary to the previous 
results in the IBV system, mutagenesis studies suggest that both the 
primary sequence and the structures of the SARS-CoV stems 2 and 3 
affect the efficiency of frameshifting (Baranov et al., 2005; Plant et al., 
2005). On the other hand, the complete deletion of stem 3 is not 
detrimental to frameshifting. This seeming discrepancy has led to 
the suggestion that stem 3 plays an as yet undiscovered regulatory 
role, perhaps in the switch from genome translation to replication 
(Plant et al., 2005). 


2. Replicase Proteins 


The end result of the ribosomal frameshifting-mediated translation 
of the replicase gene is the synthesis of two very large polyproteins, 
ppla and pplab. These range from 440 to 500 kDa and from 740 to 810 
kDa, respectively, and they are cotranslationally processed by two or 
three internally contained proteinase activities. The Herculean task of 
mapping all of the polyprotein processing events began at a time before 
investigators were even aware of the full sizes of coronavirus genomes 
(Denison and Perlman, 1986, 1987; Soe et al., 1987). Only relatively 
recently have replicase cleavage maps been completed for at least 
one representative from each coronavirus group (Bonilla et al., 1997; 
Kanjanahaluethai et al., 2003; Lim and Liu, 1998; Liu et al., 1998; Lu 
and Denison, 1997; Pinon et al., 1997; Schiller et al., 1998; Xu et al., 
2001; Ziebuhr and Siddell, 1999; Ziebuhr et al., 2001). Knowledge 
gained from these efforts allowed the informed prediction (Snijder 
et al., 2003; Thiel et al., 2003a) and rapid experimental verification 
(Harcourt et al., 2004; Prentice et al., 2004b) of the processing pathway 
for the SARS-CoV replicase. 

The final products of the autoproteolytic cleavage of ppla and 
pplab are 16 nonstructural proteins, designated nsp1—nsp16 (Fig. 9). 
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Fic 9. Protein products of the replicase gene. Cleavage sites and processed products of 
ppla (nsp1—nsp11) and of pplab (nsp1—nsp10, nsp12—nsp16) are shown. Predicted and/or 
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Nspl-nsp11 are derived from ppla, whereas nspl—nsp10 and 
nsp12—nsp16 are derived from pplab. Thus, all products processed 
from ppla are common to those processed from pplab, except for 
nsp11, which is an oligopeptide generated when ribosomal frameshift- 
ing does not occur. For IBV, which lacks a counterpart of nsp1, there 
are 15 final products of polyprotein cleavage. These are numbered 
beginning with nsp2, in order to maintain correspondence with their 
homologs in the other coronaviruses. Comparative layouts and proces- 
sing schemes for the replicase genes of all three coronavirus groups 
can be found in the review by Ziebuhr (2005) and references therein. 
Detailed lists and schematics of cleavage sites, the proteinases respon- 
sible, and the resulting nsp products for HCoV-229E, MHV, and IBV can 
be found in Table 2 and Figure 2 of the review by Ziebuhr et al. (2000). It 
should be noted that partial proteolytic products may also be significant 
in the processing scheme. The efficiency of cleavage at particular poly- 
protein sites may be regulated by both the exact primary sequence 
at the site and the site’s accessibility to the proteinase (Ziebuhr, 2005; 
Ziebuhr et al., 2000). 

Elucidation of the precise roles of nsp1—nsp16 will be the next major 
undertaking. Functions for many domains of the coronavirus replicase 
were predicted by pioneering bioinformatics methods well before the 
term “bioinformatics” was invented (Gorbalenya et al., 1989; Lee et al., 
1991). While knowledge about many of the replicase proteins is still at 
a very early stage, substantial progress has been made for others. 
Research in this field is proceeding at an unprecedented pace for 
reasons of both opportunity and necessity. First, tools that were not 
previously available, most notably reverse genetics systems for the 
replicase gene, are now at the disposal of coronavirus researchers. 
Second, the replicase products present a wide array of promising 
targets for anti-SARS therapeutics. The information that is currently 
at hand points to a correspondence between the genomic order of the 
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encoded activities of the replicase gene and the temporal program of 
infection. The products of ppla appear to function to prepare the cell 
for infection and to assemble the machinery for RNA synthesis. Then, 
the products that are unique to pplab carry out the actual catalysis of 
RNA replication and transcription. 

The very first mature translation product for MHV ppla, nsp1, has 
been shown to play a role in cell cycle arrest. It may thus prepare a 
favorable cellular environment for viral replication (Chen and Makino, 
2004; Chen et al., 2004). The next cleavage product, nsp2, diverges 
considerably among different coronaviruses, and no function for it has 
yet been predicted or demonstrated. Surprisingly, deletion of the com- 
plete nsp2 region from the genome of MHV or SARS-CoV was not 
lethal. However, nsp2 deletion mutants showed delayed viral growth 
kinetics (Graham et al., 2005). Other early replicase products are the 
enzymes that carry out the processing of the polyproteins: papain-like 
proteinases, which are in nsp3 (Baker et al., 1993), and the main 
proteinase, which is in nsp5 (Lu et al., 1995). Most coronaviruses have 
two papain-like proteinases, designated PL1P™ and PL2°"°. By con- 
trast, IBV and SARS-CoV have a single PLP™. PL1P™ and PL2°*° may 
have arisen by duplication, and in vitro, they appear to have some 
redundancy in their activities. However, for HCoV-229E, a genetic 
analysis showed that PL2P™ is essential, and the presence of both 
PL1?” and PL2?” was found to confer a clear advantage in viral fitness 
(Thiel and Siddell, 2005). In addition to the papain-like proteinases, 
nsp3 in many coronaviruses contains a domain that harbors ADP- 
ribose-1"-monophosphatase activity (Putics et al., 2005). The construc- 
tion of active-site mutants has shown that this activity is dispensable 
for replication of HCoV-229E in tissue culture. Although the cellular 
homolog of this enzyme plays a role in tRNA processing, the biological 
significance of the virally encoded activity is unknown. Nsp3 can also 
contain some variable domains. In HCoV-HKU1, as many as 14 tan- 
dem repeats of an acidic decapeptide are present in an amino-terminal 
segment of nsp3 (Woo et al., 2005 [note: nsp3 is misidentified as nsp1 in 
this reference]). In SARS-CoV, nsp3 contains a “SARS-unique” domain 
that is not found in any other coronavirus (Snijder et al., 2003). 

The coronavirus main proteinase, designated M?*°, constitutes all of 
nsp5. This enzyme has also been called the 3C-like proteinase (3CLP™°), 
because of its resemblance to the 3C proteinases of picornaviruses. 
Crystal structures have been solved for MP™ for HCoV-229E (Anand 
et al., 2002), TGEV (Anand et al., 2003), and SARS-CoV (Yang et al., 
2003). These reveal that M?” is a dimer, each monomer of which has a 
three-domain structure, with an active site located in a cleft between 
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the first and second domains in each monomer. At the carboxy termi- 
nus is an extra domain not found in the 3CL?”® of other viral families. 
Multiple structures determined for the SARS-CoV MP?" showed that 
the entire molecule undergoes major pH-dependent conformational 
changes, which have been proposed to regulate activity. 

At the carboxy-terminal end of ppla is a cluster of small proteins, 
nsp7—nsp10. The crystal structure of SARS-CoV nsp9 was solved inde- 
pendently by two groups (Egloff et al., 2004; Sutton et al., 2004). In 
addition, prompted by features of the structure, investigators found 
that nsp9 has nonspecific RNA-binding activity. Biophysical evidence 
has also been presented for an interaction between nsp9 and nsp8 
(Sutton et al., 2004). Therefore, although nsp9 was found to occur as 
a dimer in the crystals, its natural binding partner may be nsp8. A 
solution structure for SARS-CoV nsp7 was determined by NMR; this 
structure showed potential protein—protein interaction surfaces for 
this small polypeptide (Peti et al., 2005). Moreover, a cocrystal struc- 
ture of SARS-CoV nsp7 with nsp8 revealed a complex of eight mono- 
mers of each protein forming a hollow cylindrical structure. This 
hexadecameric assembly was proposed to be able to encircle an RNA 
template, possibly acting as a processivity factor for the RNA polymer- 
ase (Zhai et al., 2005). Thus, a picture of a putative complex of all four 
of the nsp7—nsp10 polypeptides is being gradually pieced together, but, 
as yet, there is a paucity of functional data to complement this wealth 
of structural information. 

Transmembrane domains in nsp3, nsp4, and nsp6 anchor the repli- 
case complex to intracellular membranes, and these proteins may be 
involved in the remodeling of the latter, to form double-membrane 
compartments that are dedicated to viral RNA synthesis (Bi et al., 
1999; Gosert et al., 2002; Prentice et al., 2004a; Shi et al., 1999; van 
der Meer et al., 1999). These double-membrane vesicles, which coloca- 
lize with nascent viral RNA, are distinct from the sites of virion 
assembly and budding. Coronavirus RNA synthesis may thus take 
place in structures that are similar to the autophagosomal RNA syn- 
thesis compartments that have been characterized in picornavirus- 
infected cells (Jackson et al., 2005). The nsp7—nsp10 products localize 
in discrete perinuclear and cytoplasmic foci in infected cells (Bost et al., 
2000), in a membrane-associated complex that also includes nsp2. This 
complex colocalizes with N protein and the viral helicase (nsp13) early 
in infection. However, late in infection, N protein and the helicase 
segregate into biochemically distinct membranes in the ERGIC that 
also contain M protein, suggesting a role for the helicase in genome 
encapsidation or packaging (Bost et al., 2001; Sims et al., 2000). 
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The postribosomal frameshift products of the replicase, nsp12— 
nsp16, contain the actual enzymes of RNA replication and transcrip- 
tion. The coronavirus RNA-dependent RNA polymerase (RdRp) is 
contained within nsp12, the first part of pplab synthesized after fra- 
meshifting. This protein has the fingers, palm, and thumb domains 
common to a number of viral RdRps and reverse transcriptases. In 
addition, the RdRp contains a very large, amino-terminal domain that 
is unique to the coronaviruses. For MHYV, the ability of the RdRp to 
associate with intracellular membranes was mapped to a 38-amino 
acid segment of the unique domain (Brockway et al., 2003). Membrane 
association of expressed RdRp also depended on MHV infection, indi- 
cating that other viral components are required for this targeting. In 
addition, the RdRp was shown to form intermolecular associations 
with MP”, nsp8, and nsp9. For the SARS-CoV RdRp, preliminary 
biochemical characterization of the bacterially expressed enzyme sug- 
gests that the coronavirus-unique domain is essential for activity 
(Cheng et al., 2005). 

Nsp13 contains multiple activities that have been extensively char- 
acterized for HCoV-229E and SARS-CoV (Ivanov and Ziebuhr, 2004; 
Ivanov et al., 2004a; Seybert et al., 2000). This protein is a helicase 
with a highly processive duplex unwinding activity for both DNA and 
RNA substrates. The nsp13 helicase unwinds with 5’—3’ polarity, sug- 
gesting that it has a role in preparing the template for the RdRp. 
Nsp13 also has RNA-dependent NTPase and dNTPase activities, 
which probably provide the energy for its translocation along RNA 
templates. In addition, nsp13 is a RNA 5’-triphosphatase, making it 
a candidate to carry out the initial step of RNA capping. 

Nsp14 and nsp15 have each been assigned ribonucleolytic functions. 
Such activities would, at first glance, seem to be out of place in an RNA 
virus. Nsp14 has been predicted to be an exonuclease (designated 
ExoN), which, it is speculated, could be involved in an RNA processing 
step integral to coronavirus transcription (Snijder et al., 2003). 
This activity has not yet been demonstrated, but a point mutation in 
nsp14 of MHV was shown to be markedly attenuating in the mouse 
host (Sperry et al., 2005). Nsp15 is an endoribonuclease, designated 
NendoU, that is found only in the nidoviruses (Snijder et al., 2003). 
This enzyme, from HCoV-229E and SARS-CoV, has been shown to 
hydrolyze both single- and double-stranded RNA, with a specificity 
for cleavage immediately upstream and downstream of uridylate resi- 
dues (Bhardwaj et al., 2004; Ivanov et al., 2004b). NendoU exhibited 
optimal activity with manganese ion, rather than magnesium ion, 
and it was essentially inactive with 2’-O-ribose-methylated RNA 
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substrates (Ivanov et al., 2004b). Mutation of the active site of nsp15 of 
HCoV-229E was found to be lethal. 

Finally, nsp16, the carboxy-terminal product of pplab, has been 
predicted to contain 2’-O-methyltransferase activity (Snijder et al., 
2003; von Grotthuss et al., 2003 [note: nsp16 is misidentified as nsp13 
in this reference]). Such an activity, which has not yet been demon- 
strated, would have a most obvious role in RNA capping. However, the 
possibility has been raised that 2/-O-methylation serves to protect a 
segment of duplex RNA from the NendoU activity of nsp15 in one stage 
of discontinuous negative-strand RNA synthesis (Ivanov et al., 2004b). 
Relevant to RNA capping, it must be noted that if coronaviruses 
possess their own guanylyltransferase or cap 7-methyltranferase activ- 
ities, these have not yet been identified among the many replicase 
proteins. 


3. Host Factors 


RNA viruses often expropriate and redirect host cell components, to 
assist in mechanisms of their own gene expression (Ahlquist et al., 
2003). A number of host factors have been proposed to participate in 
coronavirus RNA synthesis. To date, all of these have been discovered 
with either MHV or BCoV, and all were originally identified on the 
basis of their ability to bind in vitro to RNA segments of functional 
importance. The most completely characterized coronavirus host factor 
is heterogeneous nuclear ribonucleoprotein Al (hnRNP A1), which was 
initially found as a member of a set of proteins that bound to the 
negative strand of the MHV TRS (Furuya and Lai, 1998; Li et al., 
1997; Zhang and Lai, 1995). Its RNA-binding property, its affinity for 
MHV N protein, and its propensity to dimerize, all made hnRNP A1 
attractive as a potential mediator of the antigenome looping-out event 
envisaged by the leader-primed transcription model (Wang and Zhang, 
1999; Zhang and Lai, 1995; Zhang et al., 1999). Overexpression of 
hnRNP A1 was shown to result in a marked increase in the kinetics 
of MHV RNA synthesis, suggesting that this factor affects genome 
replication as well as transcription. Additionally, expression of a 
truncated form of hnRNP A1 had a dominant-negative effect on MHV 
replication (Shi et al., 2000). The role of hnRNP A1 was questioned on 
the basis of the finding that MHV replication and RNA synthesis were 
completely unimpaired in CB3 cells, a mutant murine cell line that 
does not express hnRNP A1 (Ben-David et al., 1992; Shen and Masters, 
2001). In addition, high-affinity hnRNP A1 binding sites (Burd and 
Dreyfuss, 1994), when placed in the MHV genome, did not act in lieu of 
a TRS and did not displace the site of leader-body fusion away from a 
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TRS (Shen and Masters, 2001). However, it was subsequently shown 
that other hnRNP A/B family members, which are present in CB3 cells, 
could replace hnRNP A1; further, overexpression of hnRNP A/B was 
shown to enhance MHV RNA synthesis (Shi et al., 2003). 

Other members of the hnRNP family have also been implicated in 
MHV RNA synthesis. Pyrimidine tract-binding protein (PTB, also 
known as hnRNP I) was shown to bind to pentanucleotide repeats 
upstream of the positive-strand leader copy of the TRS (Li et al., 
1999). In addition, PTB bound the negative strand of the 3’ UTR, 
specifically at the complement of the invariant octanucleotide motif 
(Huang and Lai, 1999). The positive strand of the same region of the 
3’ UTR was also bound by hnRNP A1, and deletions in this region 
inhibited DI RNA synthesis (Huang and Lai, 2001). Another hnRNP, 
synaptotagmin-binding cytoplasmic RNA-interacting protein (SYN- 
CRIP), was found to bind to both positive- and negative-strand MHV 
RNA near the region of the leader pentanucleotide repeats (Choi et al., 
2004). Moreover, RNAi-mediated downregulation of SYNCRIP delayed 
the kinetics of MHV RNA synthesis. In the BCoV 5’ UTR, multiple 
complexes of six proteins have been found to bind specifically to the 
stem-loop IV that is required for DI RNA replication (Raman and 
Brian, 2005). It is not yet clear whether some of these proteins are 
previously identified hnRNPs or whether they represent new cellular 
factors. 

In the 3’ UTR of MHV, a complex of proteins was found to bind to two 
similar 11-base motifs in positive-strand RNA, at distances of 26-36 
and 129-139 nucleotides from the poly(A) tail (Liu et al., 1997; Yu and 
Leibowitz, 1995a,b). DI RNAs with mutations in either of these ele- 
ments were defective in replication. The largest member of the protein 
complex was identified as mitochondrial aconitase, a protein not pre- 
viously known to have RNA-binding activity (Nanda and Leibowitz, 
2001). Other components of the complex were then found to be the 
chaperones HSP60, HSP40, and mitochondrial HSP70 (Nanda et al., 
2004). Although MHV replication does not have any known involve- 
ment with mitochondria, both mitochondrial aconitase and mitochon- 
drial HSP70 have substantial cytoplasmic fractions. Finally, at the 
furthest downstream ends of the genomes of MHV and BCoV, poly(A) 
binding protein binds to the poly(A) tail and appears to play a role 
in RNA synthesis beyond its function in translation (Spagnolo and 
Hogue, 2000). 

Among the array of candidate host factors in coronavirus RNA syn- 
thesis, it remains to be established which are essential and which play 
enhancing roles, either as RNA chaperones or in some other capacity. 
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Such assessments can be difficult, because many of these factors are 
critical or essential to normal cellular functions. Thus, the validation of 
host factors will likely require the establishment of an efficient in vitro 
RNA replication and transcription system, in which reconstitution of 
coronavirus RNA synthesis can be achieved from isolated components 
and precursors. 


VI. GENETICS AND REVERSE GENETICS 


Numerous classical coronavirus mutants have been isolated over the 
past 25 years, mainly with MHV (Lai and Cavanagh, 1997). Mutants 
were either identified as naturally occurring viral variants (often on 
the basis of causing atypical pathogenesis), or else they were obtained 
through selection criteria such as escape from neutralization by mono- 
clonal antibodies. A number of sets of MHV mutants were generated by 
chemical mutagenesis, followed by screening for temperature-sensitive 
phenotypes (Koolen et al., 1983; Martin et al., 1988; Robb et al., 1979; 
Schaad et al., 1990) or, in one case, for aberrant cytopathic effects or 
plaque morphologies (Sturman et al., 1987). Although the latter search 
yielded an unusually high proportion of structural protein mutants, 
viruses with conditionally lethal, RNA-negative phenotypes were the 
predominant isolates in all searches. The arrangement of the corona- 
virus genome dictates that the vast majority of randomly generated 
mutations will fall in the replicase gene, owing to its large target size. 
Despite assiduous efforts that applied classical genetic methods to the 
study of the replicase (Baric et al., 1990; Fu and Baric, 1992, 1994; 
Schaad et al., 1990), progress was limited by the technology available 
at the time, and exploitation of the full value of these mutants would 
await the development of reverse genetic techniques. 

The basic blueprint for positive-strand RNA virus reverse genetics— 
the transcription of infectious RNA from a full-length cDNA copy of the 
viral genome—was established more than two decades ago with polio- 
virus (Racaniello and Baltimore, 1981). It became possible only recently 
to apply this scheme to coronaviruses, however, owing to the need to 
surmount a number of formidable hurdles. Most notable were the ob- 
stacles posed by the huge sizes of coronavirus genomes and the high 
instabilities of various regions of the replicase gene when they were 
propagated as cloned cDNA in E. coli. The first reverse genetic sys- 
tem for coronaviruses, targeted RNA recombination, was developed to 
circumvent these barriers, at a time when it was far from clear whether 
the construction of full-length infectious cDNA clones would ever be 
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technically feasible (Masters, 1999; Masters and Rottier, 2005). This 
method, originally developed in MHV, takes advantage of the high rate 
of homologous RNA recombination in coronaviruses. A synthetic donor 
RNA bearing the mutation of interest is introduced into cells that have 
been infected with a recipient parent virus possessing some character- 
istic that can be selected against. Mutant recombinants that arise 
among progeny viruses are then identified by counterselection of the 
recipient parent virus. 

The earliest form of targeted RNA recombination employed, as the 
recipient parent virus, a classical MHV mutant that was thermolabile 
owing to an internal deletion in the N gene (Koetzner et al., 1992; 
Peng et al., 1995a), which is the 3’-most gene in the genome. Mutations 
were introduced into the N gene or the 3’ UTR by means of in vitro- 
synthesized donor RNAs corresponding to the smallest MHV sgRNA. 
Recombinants, which were identified as survivors of a heat-killing 
selection, had restored the region deleted in the parent virus and, 
concomitantly, had acquired marker mutations planted in the donor 
RNA. The efficiency of this system was subsequently increased by the 
incorporation of 5’-cis-acting elements that converted the donor RNA 
into a replicating DI RNA (Masters et al., 1994; van der Most et al., 
1992). The scope of this technique was then extended through the 
addition of 3’-contiguous genomic sequence to donor RNAs, ultimately 
allowing reverse-genetic access to all of the structural genes of MHV 
(Fischer et al., 1997a,b, 1998; Peng et al., 1995b). The strength and 
versatility of targeted RNA recombination were substantially enhanced 
as a result of the construction of the interspecies coronavirus mutant 
fMHV, a chimera in which the S protein ectodomain of MHV was re- 
placed by the S protein ectodomain from FIPV (Kuo et al., 2000). This 
replacement resulted in a virus that had acquired the ability to grow in 
feline cells and had simultaneously lost the ability to grow in murine 
cells. Although the immediate rationale for the creation of f{MHV was 
to dissect domain requirements for virion assembly (Section IV.B.2), it 
was readily apparent that this chimera offered a tremendous selective 
advantage in targeted RNA recombination. The use of f{MHV as the 
recipient parent virus allowed the selection of recombinants har- 
boring virtually any nonlethal MHV mutation in the 3’-most 10 kb of 
the genome, on the basis of their having regained the ability to grow 
in murine cells. Numerous mutants, many with extremely fragile phe- 
notypes, have since been obtained by this method (de Haan et al., 
2002a,b; Goebel et al., 2004a,b; Hurst et al., 2005; Kuo et al., 2002, 
2003). The generality of this host-range-based selection system has 
been established by the extension of the method to another strain of 
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MHV (Ontiveros et al., 2001) and by use of an analogous chimera, 
mFIPV, for the construction of FIPV mutants (Haijema et al., 2003, 
2004). 

Despite its value, however, targeted RNA recombination can be used 
to engineer only the downstream one-third of the genome. The com- 
plete extent of reverse genetics did not become available to coronavirus 
research until relatively recently. Through the exceptional persever- 
ance and inventiveness of three independent laboratories, systems 
based on full-length cDNA clones have been developed, each using 
a different strategy to overcome the stability problems inherent to 
coronavirus cDNA. These systems all provide a capability of great 
importance that is effectively beyond the scope of targeted RNA recom- 
bination: access to the replicase gene. In the first such method 
(Enjuanes et al., 2005), a full-length cDNA copy of the TGEV genome 
was assembled in a low copy-number bacterial artificial chromosome 
(BAC) vector. Infectious coronavirus RNA was produced in this system 
by a “DNA-launch,” in vivo nuclear transcription by host RNA poly- 
merase II from an engineered CMV promoter (Almazan et al., 2000). 
The DNA launch ensured complete capping of the viral RNA, and it 
bypassed potential limitations of the system arising from the efficiency 
of in vitro transcription of genomic RNA. Heterologous sequence was 
removed from the 3’ end of the transcribed RNA through the action of 
an incorporated hepatitis delta virus ribozyme. Further stabilization 
of the full-length BAC clone in bacteria was achieved through the 
insertion of a eukaryotic intron into either of two positions in 
the mapped toxic region of the TGEV cDNA (Gonzalez et al., 2002). 
This allowed stable propagation of the BAC for over 200 bacterial 
generations. 

In the second method, full-length genomic cDNAs were assembled 
by in vitro ligation of smaller, more stable subcloned cDNAs (Baric and 
Sims, 2005). Infectious RNA was then transcribed in vitro from the 
ligated product. The boundaries of the subcloned genomic cDNA frag- 
ments were chosen so as to allow ease of manipulation for site-directed 
mutagenesis applications. Most importantly, some fragment bound- 
aries were arranged in such a way as to interrupt regions of cloned 
cDNA instability. This is essentially the same scheme that had been 
earlier used to produce infectious RNA for yellow fever virus, a flavivi- 
rus (Rice et al., 1989). However, for coronaviruses, the scheme had to 
be executed on a much grander scale, with five to seven fragments 
instead of two. To facilitate this approach, the innovation was intro- 
duced of directing the unique assembly of fragments by means of 
nonsymmetric overhangs generated by restriction enzymes that cut 
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at a distance from their recognition sequences. This ensured that the 
fragments became connected in a predetermined order by ligation, 
without the generation of rearranged byproducts. Originally demon- 
strated with TGEV (Yount et al., 2000), this in vitro assembly tech- 
nique has subsequently been successfully used to engineer the 
genomes of MHV (Yount et al., 2002), SARS-CoV (Yount et al., 2003), 
and IBV (Youn et al., 2005). 

In the third method, entire coronavirus cDNAs, generated by long- 
range RT-PCR (Thiel et al., 1997), were inserted into a unique restric- 
tion site in the genome of vaccinia virus (Thiel and Siddell, 2005). In 
this scheme, vaccinia virus served as a huge cloning vehicle, in 
which the coronavirus genome cDNAs did not exhibit the instabilities 
encountered in E. coli plasmids. Infectious RNA was produced by 
in vitro transcription from purified vaccinia virus DNA (Thiel et al., 
2001a). Alternatively, a DNA launch was carried out in vivo with 
transfected cDNA and fowlpox-encoded T7 RNA polymerase (Casais 
et al., 2001). The use of vaccinia as a vector has allowed manipulation 
of the resulting cloned cDNA by any among the suite of methods that 
have been developed for poxvirus reverse genetics. In particular, tran- 
sient dominant selection has been used to carry out site-directed mu- 
tagenesis (Britton et al., 2005). Engineered mutations have also been 
directly recombined from PCR products into vaccinia clones, through 
exploitation of both negative and positive selection of a gpt cassette 
(Coley et al., 2005). A further innovation came from the rescue of 
recombinant coronaviruses from cell lines expressing N protein, given 
that N protein has been shown to greatly enhance recovery of virus in 
all three full-length cDNA systems (Almazan et al., 2004; Schelle et al., 
2005; Thiel et al., 2001a; Yount et al., 2002). This poxvirus-vectored 
technique was originally applied to HCoV-229E (Thiel et al., 2001a), 
and it has since been used to engineer the genomes of IBV (Casais 
et al., 2001) and MHV (Coley et al., 2005). 

The two main options for reverse genetic systems both have their 
own relative advantages. For reverse genetic studies involving corona- 
virus structural genes or the 3’ UTR, targeted RNA recombination is 
currently the easier system to manipulate, and it has the power to 
recover extremely defective mutants. Another asset of targeted RNA 
recombination is that it lends itself well to studies involving domain 
exchange between different proteins (Peng et al., 1995b) or the ex- 
change of genomic elements (Hsue and Masters, 1997). In these cases, 
the system, through its own selection of allowable crossover sites, can 
reveal which substitutions retain functionality and which are lethal. 
On the other hand, full-length cDNA reverse-genetic strategies provide 
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the capacity to site-specifically mutagenize the exceedingly large viral 
RNA replicase gene. This advantage is just beginning to be exploited, 
and it can be expected to play a major role in the future in the acquisition 
of an understanding of the workings of the complex RNA synthesis 
machinery. In addition to molecular biological studies, coronavirus 
reverse-genetic investigations have opened the door to the develop- 
ment of these viruses, and their derivative replicons, for vaccines 
(Alonso et al., 2002; Haijema et al., 2004), expression systems (de Haan 
et al., 2003b, 2005), and gene delivery vectors (Thiel et al., 2001b, 
2003b). 
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ABSTRACT 


Chlorella viruses or chloroviruses are large, icosahedral, plaque- 
forming, double-stranded-DNA-containing viruses that replicate in 
certain strains of the unicellular green alga Chlorella. DNA sequence 
analysis of the 330-kbp genome of Paramecium bursaria chlorella virus 
1 (PBCV-1), the prototype of this virus family (Phycodnaviridae), pre- 
dict ~366 protein-encoding genes and 11 tRNA genes. The predicted 
gene products of ~50% of these genes resemble proteins of known 
function, including many that are completely unexpected for a virus. 
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In addition, the chlorella viruses have several features and encode 
many gene products that distinguish them from most viruses. These 
products include: (1) multiple DNA methyltransferases and DNA site- 
specific endonucleases, (2) the enzymes required to glycosylate their 
proteins and synthesize polysaccharides such as hyaluronan and chitin, 
(3) a virus-encoded K* channel (called Kev) located in the internal 
membrane of the virions, (4) a SET domain containing protein (referred 
to as vSET) that dimethylates Lys27 in histone 3, and (5) PBCV-1 
has three types of introns; a self-splicing intron, a spliceosomal pro- 
cessed intron, and a small tRNA intron. Accumulating evidence 
indicates that the chlorella viruses have a very long evolutionary 
history. This review mainly deals with research on the virion structure, 
genome rearrangements, gene expression, cell wall degradation, poly- 
saccharide synthesis, and evolution of PBCV-1 as well as other related 
viruses. 


I. INTRODUCTION 


Members, including the chlorella viruses, and prospective members 
of the family Phycodnaviridae constitute a genetically diverse, but 
morphologically similar, group of viruses with eukaryotic algal hosts 
from both fresh and marine waters. The family name derives from two 
distinguishing characteristics: (1) “phyco” from their algal hosts and 
(2) “dna” because all of these viruses have dsDNA genomes (Wilson 
et al., 2005b). The phycodnaviruses have some of the largest virus 
genomes known, ranging in size from ~170 to 560 kb and contain 
several hundred protein-encoding genes. 

The phycodnaviruses are among the virioplankton recognized as 
important ecological elements in aqueous environments. They, along 
with other viruses, play important roles in the dynamics of algal blooms, 
nutrient cycling, algal community structure, and possibly gene transfer 
between organisms. The discovery phase of aquatic viruses, including 
the phycodnaviruses, is just beginning with new viruses continually 
being discovered as more environmental samples are examined. Ongo- 
ing metagenomic studies involving massive DNA sequencing reveal a 
greater viral diversity than could have been imagined just a few years 
ago (Hambly and Suttle, 2005; Wommack and Colwell, 2000). The ge- 
netic diversity that exists in the phycodnaviruses, albeit with only a 
few genomes sequenced, is enormous. To illustrate this diversity, virus- 
es in three genera of the phycodnaviruses have been sequenced and 


CHLORELLA VIRUSES 295 


each of these viruses encodes several hundred genes. However, only 14 
of these genes are common to all three viruses (Dunigan et al., 2006). 
Thus, there are more than 1000 unique genes in just these three 
viruses. 

Accumulating evidence also indicates that the phycodnaviruses are 
probably very old viruses. The phycodnaviruses together with the 
poxviruses, iridoviruses, asfarviruses, and the recently discovered 
1.2-Mb Mimivirus probably have a common evolutionary ancestor, 
perhaps arising at the time eukaryotes separated from prokaryotes, 
approximately 3 billion years ago (Raoult et al., 2004; Villarreal, 2005; 
Villarreal and DeFilippis, 2000). All of these viruses share 9 gene 
products, and 33 more gene products are present in at least two of 
these five viral families (Iyer et al., 2001; Raoult et al., 2004). Collec- 
tively, these viruses are referred to as nucleocytoplasmic large DNA 
viruses (NCLDV) (Iyer et al., 2001). 

This review focuses on the chlorella viruses that constitute one genus 
in the family Phycodnaviridae. Phycodnaviruses are large (mean diam- 
eter of 160 + 60 nm) icosahedrons and, where known, the viruses have 
an internal membrane that is required for infection. Phylogenetic ana- 
lyses of the 6-DNA polymerases from the phycodnaviruses indicate that 
they are more closely related to each other than to other dsDNA viruses 
and that they form a monophyletic group, consistent with a common 
ancestor (Wilson et al., 2005b). However, the viruses fall into six clades 
which correlate with their hosts and each has been given genus status. 
Often the genera can be distinguished by additional properties for 
example, lytic versus lysogenic life styles or linear versus circular gen- 
omes (Wilson et al., 2005b). Members of the genus Chlorovirus (chlorella 
viruses) infect fresh water algae, whereas members of the other five 
genera (Coccolithovirus, Phaeovirus, Prasinovirus, Prymnesiovirus, 
and Raphidovirus) infect marine algae. 

The type chorella virus is Paramecium bursaria chlorella virus 1 
(PBCV-1). Because there have been several reviews on the phycodna- 
viruses and the chlorella viruses (Dunigan et al., 2006; Kang et al., 
2005; Van Etten, 2003; Van Etten and Meints, 1999; Van Etten 
et al., 1991, 2002), this review deals mainly with research on virion 
structure, infection cycle, genome rearrangements, gene expression, cell 
wall degradation, and polysaccharide synthesis. Additional information, 
including a complete list of chlorella virus publications and additional 
images of the viruses, is available on the “World of Chlorella Viruses” Web 
site at: http://www.ianr.unl.edu/plantpath/facilities/Virology/index.htm/. 
The general history of the algal viruses and ecological aspects of these 
fascinating viruses can be found in other reviews (Brussaard, 2004; 
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Dodds, 1979; Fuhrman, 1999; Lemke, 1976; Müller et al., 1998; Suttle, 
2000, 2005; Wommack and Colwell, 2000). 


II. RESEARCH History, CLASSIFICATION, AND SPECIFIC FEATURES 


A. History and Classification of Viruses and their Hosts 


In 1978, Kawakami and Kawakami (1978) described the appearance of 
large (~180 nm in diameter) icosahedral, lytic viruses in zoochlorellae 
of the protozoan Paramecium bursaria (designated zoochlorella cell virus 
[ZCV]) after the algae were released from the paramecium. No virus 
particles were detected in zoochlorellae growing symbiotically inside the 
paramecium cells, although ZCV particles were present in the depressions 
of the pellicle, between the cilia, and in the food vacuole of the paramecium. 
ZCV infected the zoochlorella by adsorption and digestion of the cell wall 
and virus particles accumulated in the algal cytoplasm. After cell lysis, 
progeny viruses were released into the medium. 

Independently, a few years later, similar lytic viruses were described 
in zoochlorellae isolated from the green coelenterate Hydra viridis 
(Meints et al., 1981; Van Etten et al., 1981) and also from P. bursaria 
(Van Etten et al., 1982). A laboratory infection system for these viruses 
was developed using exsymbiotic zoochlorella strains as hosts, includ- 
ing Chlorella NC64A that was originally isolated from a P. bursaria 
(Van Etten et al., 1983a). This system allows chlorella viruses to be 
produced in large quantities and the viruses can be assayed by plaque 
formation using standard bacteriophage techniques (Van Etten 
et al., 1983a). Since these early studies, literally hundreds of chlorella 
viruses have been isolated from natural sources. 

Chlorella viruses included in the genus Chlorovirus (Wilson et al., 
2005b) currently consist of three species: (i) Viruses that infect 
Chlorella NC64A (NC64A viruses). (ii) Viruses that infect Chlorella 
Phi (Pbi viruses). NC64A viruses neither infect nor attach to Chlorella 
Pbi, and vice versa. (iii) Viruses that infect symbiotic zoochlorella in 
the coelenterate Hydra viridis. Hydra zoochlorella have not been 
cultured free of virus, so these viruses can only be isolated from 
chlorella cells freshly released from hydra. Recently, a virus that in- 
fects zoochlorella of the heliozoon Acanthocystis turfacea was described 
(Bubeck and Pfitzner, 2005). These viruses, designated ATCV-1 and 
ATCV-2, infect Chlorella SAG 3.83, a symbiont of A. turfacea but do not 
infect either Chlorella SAG 211-6, a host for the NC64A viruses, 
or Chlorella SAG 241-80, a host for the Pbi viruses. 
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NC64A viruses have been isolated from fresh water collected in 
the United States (Van Etten et al., 1985a), South America, Japan 
(Yamada et al., 1991), China (Zhang et al., 1988), South Korea (Cho 
et al., 2002), Australia, Israel, and Italy (Van Etten et al., 2002). Phi 
viruses initially were discovered in fresh water collected in Europe 
(Reisser et al., 1988) and more recently in water collected in Australia, 
Canada, and the northern United States or at higher elevations in 
the western United States (Van Etten, J. L., and Nelson, M., unpub- 
lished results). The most important factors influencing the distribution 
of NC64A and Pbi viruses are probably latitude and altitude. Chlorella 
NC64A and Chlorella Phi were originally isolated from American and 
European isolates of P. bursaria, respectively. The component sugars 
in the cell walls of Chlorella NC64A and Chlorella Pbi differ consider- 
ably (Kapaun et al., 1992). Because the viruses can distinguish the two 
chlorella isolates, it seemed likely that the host receptor for the viruses 
might also serve as the recognition factor for becoming a symbiont in 
the paramecia. However, Chlorella NC64A and Chlorella Pbi each 
established a stable symbiotic relationship with both American and 
European isolates of P. bursaria (Reisser et al., 1991). 

18S rRNA sequence analyses of zoochlorella from American and 
European paramecia have been conducted (Hoshina et al., 2004, 
2005). Zoochlorella 18S rRNAs separate into two lineages; NC64A 
(USA), Syngen 2-3 (USA), Cs2 (China), MRBGI1 (Australia), and 
strains from Japan belong to the American type, whereas PB-SW1 
(Germany) and CCAP 1660/11 (UK) strains belong to the European 
type. The American type symbionts have three group-I introns in the 
18S rRNA genes, whereas a single group-I intron, located at a different 
position, exists in the European symbionts. Likewise, the 185 rRNA 
sequence distinguishes the two groups: (1) Cs2, MRBG1, and strains 
from Japan and (2) PB-SW1 and CCAP 1660/11. The American type 
and European type of zoochlorellae correspond to the hosts for NC64A 
viruses and Pbi viruses, respectively. It will be interesting to deter- 
mine where Chlorella SAG3.83, which is the host for new chlorella 
viruses, fits into this scheme (Bubeck and Pfitzner, 2005). 


B. Specific Features of the Chlorella Viruses 


The chlorella viruses have many interesting properties. Most of 
these studies have been conducted on PBCV-1 and its related NC64A 
viruses such as chlorella virus Kyoto 2 (CVK2) (Van Etten and Meints, 
1999; Van Etten et al., 1991; Yamada et al., 1991). Some of these 
features are summarized as follows: 
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i. The virus genomes are large linear dsDNAs (315-380 kbp) 
with cross-linked hairpin termini. 

il. The viruses infect host cells by a bacteriophage-like mechanism. 

iii. Chlorella viruses often encode multiple type II DNA methyl- 
transferases and DNA site-specific (restriction) endonucleases 
(Nelson et al., 1993, 1998). 

iv. Unlike other glycoprotein-containing viruses, chlorella viruses 
encode most, if not all, of the components required to glycosy- 
late their proteins (Graves et al., 2001; Markine-Goriaynoff 
et al., 2004; Wang et al., 1993). 

v. Chlorella viruses were the first viruses discovered to have 
more than one type of intron: PBCV-1 has a self-splicing in- 
tron in a transcription factor TFIIS-like gene (Li et al., 1995; 
Yamada et al., 1994), a spliceosomal-processed intron in its 
DNA polymerase gene (Grabherr et al., 1992; Zhang et al., 
2001), and a small intron in one of its tRNA genes (Nishida 
et al., 1999a). 

vi. Many chlorella virus encoded proteins are either the smallest 
or among the smallest proteins of their class and some may 
represent the minimal catalytic unit. 

vii. Some translational components, such as tRNAs (Li et al., 1997; 
Nishida et al., 1999a), and elongation factor EF-3 (Yamada et al., 
1993) are virus encoded and expressed during the infection cycle. 

viii. Chlorella viruses encode a functional K* channel protein called 
Kev. Kev was the first virus encoded K* channel to be discov- 
ered, and it is the smallest protein known to form a functional 
K* selective channel (Gazzarrini et al., 2003; Kang et al., 2003; 
Plugge et al., 2000). 


III. VIRION STRUCTURE 


PBCV-1 virions have a sedimentation coefficient of about 2300 S 
in sucrose density gradients (Van Etten et al., 1983b) and an estimated 
molecular mass of 1 x 10° Da (Yonker et al., 1985). The virion contains 
64% protein, 21-25% DNA, and 5-10% lipid (Skrdla et al., 1984). 
The PBCV-1 virion contains more than 100 virion-encoded proteins 
(Skrdla et al., 1984; Dunigan, D. D. e¢ al., unpublished results) includ- 
ing two DNA restriction endonucleases, DNA binding proteins and 
protein kinases (Yamada et al., 1996). The PBCV-1 54 kDa major 
capsid protein Vp54 is a glycoprotein and comprises ~40% of the virus 
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protein. Vp54 has been crystallized and consists of 2 eight-stranded, 
antiparallel 6-barrel, “jelly-roll” domains related by a pseudo-sixfold 
rotation (Nandhagopal e¢ al., 2002). 

Ultrastructural studies have been conducted on both intact and dis- 
rupted chlorella virus virions using either negative staining (Becker 
et al., 1993) or cryo-electron microscopy with three-dimensional image 
reconstructions (26 A resolution) (Yan et al., 2000). The latter study was 
complemented and extended by fitting the structure of the major capsid 
protein Vp54 to the cryo-electron microscopy density maps of PBCV-1 
(Simpson et al., 2003). The outer glycoprotein capsid is icosahedral and 
surrounds a lipid bilayer membrane. The membrane is connected to the 
outer shell by regularly spaced proteins. Disruption of the membrane 
destroys PBCV-1 infectivity (Skrdla et al., 1984). The outer diameter of 
the viral capsid varies from a minimum of 1650 A along the two- and 
threefold axes to a maximum of 1900 A along the fivefold axes. The 
capsid shell consists of 1680 donut-shaped trimeric capsomers plus 12 
pentameric capsomers at each icosahedral vertex. The trimeric cap- 
somers are arranged into 20 trisymmetrons (each containing 66 tri- 
mers) and 12 pentasymmetrons (each containing 30 trimers and 1 
pentamer at the icosahedral vertices) (Fig. 1). Assuming all the trimeric 
capsomers are identical, the outer capsid of the virus contains 5040 
copies of the major capsid protein Vp54. The triangulation number (T) 
for the virus is 169 (h = 7, k = 8) and the virus has a right handed, skew 
class of T lattice (Caspar and Klug, 1962). 

Most of the trimeric capsomers have a central, concave depression 
surrounded by three protuding towers. The trimeric capsomers are 
72 A in diameter and ~75 A high. The capsomers interconnect at their 
bases in a contiguous shell that is 20-25 A thick. Twelve pentamer 
capsomers, each ~70 A in diameter, exist at the virus fivefold vertices 
and probably consist of a different protein. Each pentamer has a cone- 
shaped axial channel at its base. One or more proteins appear below 
the axial channel and outside the inner membrane (Fig. 1B). This 
protein(s) may be responsible for digesting the host cell wall during 
infection. Presumably contact between the virus and its host receptor 
alters the channel sufficiently to release the wall-degrading enzyme(s). 

Complementary information about the PBCV-1 virion structure was 
obtained by atomic force microscopy (AFM) (Kuznetsov et al., 2005). 
Since AFM is not dependent on symmetry averaging as is cryo-electron 
microscopy, it can reveal unique properties of individual particles. 
From the response of the AFM tip in contact with the particles, the 
virus particles appear somewhat soft and are readily deformed. The 
individual trimeric capsomers appear to have a small hole in their 
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Fic 1. Three-dimensional image reconstruction of chlorella virus PBCV-1 from cryo- 
electron micrographs and atomic force micrographs. (A) The virion capsid consists of 12 
pentasymmetrons and 20 trisymmetrons. Five trisymmetrons are highlighted in the 
reconstruction (blue) and a single pentasymmetron is colored yellow. A pentavalent 
capsomer (white) lies at the center of each pentasymmetron. Each pentasymmeton 
consists of one pentamer plus 30 trimers. Eleven capsomers form the edge of each 
trisymmetron (black dots) and therefore each trisymmetron has 66 trimers (Yan et al., 
2000). (B) Dense material (blue arrow) (cell wall digesting enzyme(s)?) is present at each 
vertex (red arrow) between the vertex and the membrane. (C—F) Atomic force microscopy 
images of the surfaces of PBCV-1 virions showing the pentameric arrangements of 
proteins around the fivefold vertices, with a unique protein on the vertex (Kuznetsov 
et al., 2005). 


CHLORELLA VIRUSES 301 


center and a distinctive triangular shape that is more angular and 
accentuated than the “doughnut” shape deduced from the cryo- 
electron microscopy (Yan et al., 2000). The pentagonal vertices are 
formed by five copies of a different protein, and this has yet another 
unique protein in its center (Fig. 1D and E). The central protein 
exhibits some unusual behavior when subjected to AFM tip pressure; 
it disappears into the virion interior, leaving a distinct hole. When the 
AFM tip pressure is decreased, it returns to its original position. Virion 
degradation is accompanied by the appearance of many small, 
uniform, spherical, and virus-like particles (VLP) consistent with 
T = 1 or 8 icosahedral products (Kuznetsov et al., 2005). 


IV. Virus LIFE CYCLE 


PBCV-1 infects its host by attaching rapidly, specifically, and irre- 
versibly to the external surface of the algal cell wall (Meints et al., 
1984). Attachment always occurs at a virus vertex, possibly with hair- 
like appendages (Van Etten et al., 1991) and is followed by degradation 
of the host wall at the attachment point. The determinants for host 
range are associated with attachment. Onimatsu et al. (2004) reported 
that a virion protein Vp130 from Chlorovirus CVK2 binds specifically 
to the host Chlorella cell wall. Vp130 is a homolog of PBCV-1 protein 
A140/145R and consists of 1126 amino acid residues (predicted mol wt, 
121,257 and pI, 10.76). The Vp130 N-terminus is blocked by some 
unknown structure and the C-terminus consists of 23 tandem PAPK 
repeats. Internally, Vp130 contains seven repeats of 70—73 amino 
acids, each copy of which is separated by several PAPK sequences. 
This protein is well conserved among the NC64A viruses. Because 
externally added Vp130 competes with CVK2 in binding to host cells, 
Vp130 is most likely a host-recognizing protein in the virion (Onimatsu 
et al., 2004). Immune electron microscopy using Vp130 antibody estab- 
lished that the protein is localized specifically at the vertices of the 
CVK2 virion (Onimatsu, H. et al., manuscript in preparation). 

Attachment of the virion to the host cell wall probably alters the 
virion structure slightly, allowing release of a virion-packaged wall 
digesting enzyme(s). Following host cell wall degradation, the internal 
membrane of the virus probably fuses with the host membrane result- 
ing in entry of the viral DNA and virion-associated proteins. An empty 
capsid is left on the cell surface. Infection results in rapid depolarization 
of the host membrane (Frohns et al., 2006; Mehmel et al., 2003), and 
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we hypothesize that this rapid depolarization is caused by a virus- 
encoded K* channel (called Kev) located in the internal membrane. 
This depolarization may aid in the release of DNA into the cell and/or 
limit subsequent infection by additional viruses. 

Circumstantial evidence indicates that the PBCV-1 DNA and sus- 
pected virion-associated proteins quickly move to the nucleus where 
early transcription is detected within 5-10 min postinfection (pi) 
(Kawasaki et al., 2004; Schuster et al., 1986). Experimental results 
indicate that host chromosomal DNA begins to be degraded, possibly 
due to two restriction endonucleases that are packaged in the PBCV-1 
virion, within minutes of infection (Agarkova, I. V. et al., submitted for 
publication; Dunigan, D. D. et al., unpublished results). 

In the immediate-early phase of infection, the host is reprogrammed 
to transcribe viral RNAs. Very little is known about how this occurs, 
but chromatin remodeling may be involved. PBCV-1 encodes a 119 
amino acid SET domain containing protein (referred to as vSET) that 
dimethylates Lys27 in histone 3 (Manzur et al., 2003). vSET is pack- 
aged in the PBCV-1 virion, and accumulating evidence indicates that 
vSET is involved in repression of host transcription following PBCV-1 
infection (Manzur, K. L. et al., manuscript in preparation). 

PBCV-1 DNA replication begins 60-90 min pi and is followed 
by transcription of late virus genes (Schuster et al., 1986; Van Etten 
et al., 1984). Ultrastructural studies of PBCV-1 infected chlorella 
suggest that the nuclear membrane remains intact, at least during 
the early stages of virus replication (Meints et al., 1986). However, a 
functional host nucleus is not required for virus replication since 
PBCV-1 can replicate, albeit poorly and with a small burst size, in 
UV-irradiated cells (Van Etten et al., 1986). Approximately 2-3 hpi, 
assembly of virus capsids begins in localized regions in the cytoplasm, 
called virus assembly centers, which become prominent at 3—4 hpi 
(Meints et al., 1986). By 5 hpi, the cytoplasm is filled with infec- 
tious progeny virus particles (~1000 particles/cell) and by 6-8 hpi 
localized lysis of the host cell releases progeny virions. Of the progeny 
released, 25-50% of the particles are infectious; that is, each infected 
cell yields ~350 plaque-forming units (PFU) (Van Etten et al., 1983b). 
Intact infectious PBCV-1 particles accumulate inside the host 
30-40 min before release. Other chlorella viruses have longer replica- 
tion cycles than PBCV-1. For example, NC64A virus NY-2A requires 
approximately 18 h for replication and consequently forms smaller 
plaques. 

Some virion proteins are processed by specific proteinase activities 
(Songsri et al., 1997). One of them is a signal peptidase-like activity 
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and removes the N-terminal 25-33 amino acids of target proteins that 
contain a highly hydrophobic sequence of 17 amino acid residues. 
Lysine with an acidic amino acid on the N-side always precedes 
the cleavage site. Proteins A168R, A203R, and A532L are targets of 
this activity. These results lead to the following questions: (1) what 
enzymes (either of viral or host origin) are responsible for processing 
the virion proteins? (2) When, where, and how does processing occur 
in the course of Chlorovirus replication? (3) What are the biological 
effects of processing? 

Thus, progress has been made on characterization of the viral struc- 
tural proteins as well as the whole virion architecture. However, many 
fundamental questions remain to be answered about the assembly of 
such a large, complex virus particles in the host cells. Like poxviruses, 
iridoviruses, and African swine fever virus (ASFV), Chlorovirus 
assembly occurs in localized regions in the cytoplasm, referred to as 
virus assembly centers (Van Etten et al., 1991). (1) Where and how is 
the virus assembly center determined? (2) What is the origin of the 
virus internal membrane? Is it derived from the ER, like other 
membrane-containing viruses (Cobbold et al., 1996; Wolf et al., 1998)? 
(3) What is the role of the membrane in assembling the virion (e.g., 
does it function as a scaffold)? (4) How is genomic DNA packaged into 
an empty preformed capsid? (5) How are the more than 100 component 
proteins specifically assembled into a virion? 


V. Virus GENE EXPRESSION: IMMEDIATE EARLY AND LATE 
GENE EXPRESSION 


The PBCV-1 genome does not encode either a recognizable RNA 
polymerase or RNA polymerase subunit (Van Etten and Meints, 
1999). The lack of a virus-encoded RNA polymerase suggests that the 
infecting viral DNA is targeted to the cell nucleus and that a host RNA 
polymerase initiates viral transcription, possibly in conjunction with 
virus-packaged transcription factors. Consistent with this possibility, 
PBCV-1 encodes at least four transcription factor-like elements, 
TFIIB, TFIID, TFIIS, and VLTF-2. PBCV-1 also encodes two enzymes 
involved in forming the mRNA cap structure, an RNA triphosphatase 
(Ho et al., 2001) and an RNA guanylyltransferase (Ho et al., 1996). 
However, there is no evidence that any of these proteins are packaged 
in the virion. The size, amino acid sequence, and biochemical proper- 
ties of the PBCV-1 capping enzymes resemble yeast capping enzymes 
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more than the multifunctional poxvirus and ASFV RNA capping 
enzymes (Gong and Shuman, 2002). PBCV-1 also encodes an active 
RNase III that presumably is involved in processing either virus 
mRNAs and/or tRNAs (Zhang et al., 2003). In addition, PBCV-1 
encodes two proteins that contain sequence elements of superfamily 
II helicases. Superfamily II helicases are involved in transcription 
(Tanner and Linder, 2001). 

Two studies (Schuster et al., 1986, 1990) which examined PBCV-1 
RNA synthesis reached the following conclusions: 


i. Viral infection rapidly inhibits host RNA synthesis. 
il. Chloroplast rRNAs but not cytoplasmic rRNAs are degraded 
beginning at about 30 min pi. 

iii. PBCV-1 transcription is temporally programmed, and the first 
transcripts appear within 5-10 min pi (immediate early tran- 
scription). A few, but not all, early virus transcripts are synthe- 
sized in the absence of de novo protein synthesis. The synthesis 
of late transcripts that occurs after 60—90 min pi (late transcrip- 
tion) requires translation of an early virus gene(s). 

iv. Early and late virus genes are interspersed throughout the 
PBCV-1 genome. 

v. Full length gene probes often hybridize to mRNA transcripts 
that are 40-60% larger than the gene itself. 

vi. Transcription mapping of seven PBCV-1 genes revealed that the 
virus has mRNAs with 5’ untranslated regions as small as 14 
nucleotides and as large as 149 nucleotides. Transcripts often 
extend beyond the translational stop codon (Graves and Meints, 
1992; Hiramatsu et al., 1999; Kawasaki et al., 2004; Schuster 
et al., 1990). 


A study (Kawasaki et al., 2004) has produced some new insights into 
immediate early expressed genes. They isolated and characterized 23 
Chlorovirus PBCV-1 and CVK2 genes expressed in host cells as early 
as 5-10 min pi. Some of these immediate early gene products resem- 
bled transcriptional factors and mRNA-processing proteins including, 
TFIIB, helicases, mRNA capping enzyme (RNA guanylyltransferase), 
nucleolin, and a bean transcription factor. Other immediate early 
genes encoded factors influencing translation such as possible amino- 
acyl tRNA synthetases, possible ribosomal proteins and unknown pro- 
teins. Enzymes involved in polysaccharide synthesis were also found. 
All of these gene transcripts had a poly(A) tail, which decreased in size 
by 20 min pi, possibly caused by an exonuclease. A typical TATA-box 
and a common 5/-ATGACAA element were present in the promoter 
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region of all 23 immediate early genes, which may be recognized by 
host RNA polymerase and transcription factors. As suggested by the 
presence of the poly(A) tail, all of the immediate early genes contain a 
typical poly(A) addition signal 5’-AATAAA-3’, 10-90 bp downstream of 
the translational stop codon. 

At 40 min pi, a dramatic change occurs in the transcription of the 
Chlorovirus genes. The immediate early genes gradually decreased in 
size, suggesting some weakening or cessation of poly(A) polymerase 
activity. Concurrently, some larger transcripts began to appear. These 
larger transcripts are due to readthrough from an upstream ORF 
and/or into a downstream ORF. These results indicate that promoter 
selection changes around 40 min pi by some mechanism, possibly in- 
volving regulatory proteins encoded by some of the immediate early 
transcripts. In other organisms, transcription termination signals in a 
gene are recognized by a series of RNA-binding proteins and RNA- 
processing enzymes including cleavage stimulation factor F (CstF), 
cleavage and polyadenylation specific factor (CPSF), and poly(A) poly- 
merase; but these factors may not function well after 40 min pi, result- 
ing in elongated or unprocessed transcripts. Once in the cytosol, the 
poly(A) tail of mRNA is gradually shortened by an exonuclease (dead- 
enylation nuclease called DAN) that digests the tail in the 3’—5’ direc- 
tion. Once the size of the poly(A) tail reaches a critical threshold, the 
mRNA 5’ cap is removed (decapping) and the RNA is rapidly degraded. 
Therefore, a 3/-extension of each transcript by readthrough might 
serve as an alternate way to protect a coding region from degradation 
by 3/-exonuclease. This is a unique feature of chloroviruses contrasting 
to similar large viruses such as vaccinia and ASFV, both of which 
encode a functional poly(A) polymerase and mRNAs are polyadenylated 
(Johnson et al., 1993; Yanez et al., 1995). 

A poly(A) tail is also involved in initiation of translation. Efficient 
translation requires the mRNA poly(A) tail to bind to poly(A)-binding 
proteins, which, in turn, interact with translation initiation factor eIF- 
4G. Poly(A)-deficient mRNAs formed 40 min pi might require an inter- 
nal ribosome entry site (IRES)-dependent mechanism in order to be 
translated (Sachs, 2000). 

A conserved nucleotide sequence has been identified in the promoter 
region of genes expressed late in PBCV-1 infected cells. Kang et al. 
(2004a) reported that an AAAAATAnTT element or a subset of this 
sequence is located 6—30 nucleotides upstream of the ATG start codon 
of seven late-expressed PBCV-1 genes. 

However, many fundamental questions regarding Chlorovirus gene 
transcription remain to be answered including: (1) what kind of RNA 
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polymerase and its related factors are involved? (2) How are transcrip- 
tion initiation and termination regulated? (3) What mechanism 
switches transcription from early to late? (4) What DNA elements are 
responsible for the promoter function? (5) What trans-acting factors 
are involved in the regulation? 


VI. Virus PROTEIN SYNTHESIS, MODIFICATION, AND DEGRADATION 


Some early virus-encoded proteins appear within 15 min pi. Since 
cycloheximide, but not chloramphenicol, inhibits viral replication, 
PBCV-1 proteins are synthesized on cytoplasmic ribosomes and not orga- 
nellar ribosomes (Skrdla et al., 1984). How the virus takes over the host 
translational machinery forcing it to translate virus mRNAs is unknown. 
However, some of the factors involved in this process are expected to be 
virus encoded. Therefore, studying the chlorella virus system may reveal 
new insights into the regulation of eukaryotic protein synthesis. 

The chlorella viruses are the first known viruses to encode a trans- 
lation elongation factor (EF) (Yamada et al., 1993). The gene for a 
putative EF-3 is highly conserved in all Chlorovirus isolates examined 
so far. The EF-3 proteins from CVK2 and PBCV-1 (94% amino acid 
identity) have ~45% amino acid identity to an EF-3 protein from fungi 
(Belfield and Tuite, 1993; Chakraburtty, 2001). The fungal protein 
stimulates EF-1la-dependent binding of aminoacyl-tRNA to the A site 
of the ribosome. Like fungal EF-3 proteins, the CVK2 and PBCV-1 
proteins have an ABC transporter family signature and two ATP/ 
GTP binding-site motifs. 

PBCV-1 and CVK2 codon usages are biased to codons ending in XXA/U 
(63%) over those ending in XXC/G (37%) (Nishida et al., 1999a; Schuster 
et al., 1990). This bias is expected because PBCV-1 DNA is 40% G+ C 
(CVK2, 41% G+ C), whereas host nuclear DNA is 67% G + C (Van Etten 
et al., 1985b). Therefore, finding that PBCV-1 encodes 11 tRNA genes 
may not be surprising: 3 for Lys, 2 each for Asn and Leu, and 1 each 
for Ile, Tyr, Arg, and Val. Similarly CVK2 encodes 14 tRNA genes: 
3 for Lys, 2 each for Asn and Leu, and 1 each for Arg, Asp, Gly, Gln, 
Ile, Tyr, and Val (Nishida et al., 1999a). None of the tRNAs have a CCA 
sequence encoded at the 3’ end of the acceptor stem. Typically these 
three nucleotides are added separately to tRNAs. Some chlorella virus- 
es encode as many as 16 tRNAs (Cho et al., 2002; Nishida et al., 1999a). 
There is a strong correlation between the abundance of virus encoded 
tRNAs and the virus gene codon use (Lee et al., 2005; Nishida et al., 
1999a). 


CHLORELLA VIRUSES 307 


The virus encoded tRNAs contain internal A and B boxes character- 
istic of RNA polymerase III promoter elements, suggesting the tRNAs 
might be transcribed individually by RNA polymerase III (Nishida 
et al., 1999a). However, the tRNA genes are transcribed as a large 
precursor RNA and processed via intermediates to mature tRNAs at 
both early and late stages of virus replication. Some, if not all, of the 
tRNAs are aminoacylated in vivo, suggesting they probably function in 
viral protein synthesis (Nishida et al., 1999a). Possibly, the virus- 
encoded EF-3 in combination with the virus encoded tRNAs alter the 
host protein synthetic machinery to preferentially translate viral 
mRNAs. 

PBCV-1 has several genes encoding proteins that are involved in 
posttranslational modification. In addition to putative glycosyltrans- 
ferases (see in a later section), PBCV-1 encodes 7 Ser/Thr-protein ki- 
nases (Valbuzzi, P. et al., unpublished results), one putative Tyr-protein 
kinase and a putative Tyr phosphatase. Three protein kinases are 
packaged in the chlorella virus CVK2 virion (Yamada et al., 1996). 
PBCV-1 also encodes several enzymes involved in posttranslational 
modification, such as an ERV/ALR protein, which functions as a protein 
thiol oxidoreductase (Senkevich et al., 2000), a putative protein disul- 
fide isomerase and a prolyl 4-hydroxylase that converts Pro-containing 
peptides into hydroxyl-Pro-containing peptides (Eriksson et al., 1999). 
Moreover, PBCV-1 encodes two putative proteins that interact with 
ubiquitin, a ubiquitin C-terminal hydrolase and a Skp1 protein. Skp1 
proteins belong to the SCF-E3 ubiquitin ligase family that targets cell 
cycle proteins and other regulatory factors for degradation (Deshaies, 
1999). Finally, PBCV-1 encodes at least one putative serine proteinase. 


VII. DIVERSITY oF CHLORELLA VIRUS GENOMES 


The PBCV-1 genome is a linear 330,744-bp nonpermuted dsDNA 
molecule with covalently closed hairpin termini (Rohozinski et al., 
1989). The termini consist of 35-nucleotide-long covalently closed 
hairpin loops that exist in one of two forms; the two forms are comple- 
mentary when the 35-nucleotide sequences are inverted (flip-flop) 
(Zhang et al., 1994). Identical 2221-bp inverted repeats are adjacent 
to each hairpin end (Strasser et al., 1991). The remainder of the 
PBCV-1 genome contains primarily single-copy DNA (Girton and Van 
Etten, 1987). These features can be compared with those of other 
chloroviruses because the genomic sequences of three more chlorella 
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viruses have been completed—viruses NY-2A and AR158 which, like 
PBCV-1, infect Chlorella NC64A and virus MT325 which infects Chlo- 
rella Pbi. These sequences are available on the Web site http://green- 
gene.uml.edu, which represents work in progress; the sequences have 
not yet been published or deposited in the public databases. Some 
comparative data are listed in Table I. PBCV-1 and all other NC64A 
virus genomes are ~40% G + C, which is significantly lower than the 
67% G + C of the host Chlorella NC64A nuclear DNA (Van Etten et al., 
1985b). Chlorovirus MT325 that is a Pbi virus has a slightly higher G 
+ C content (~45%). The newly sequenced genomes vary from 315 
(MT325) to 369 kbp (NY-2A), reflecting a difference in the number of 
genes (~330-—~400). 

Some of the additional proteins encoded by NY-2A genes include 
ubiquitin, chitin synthase, N-acetylglucosaminy] transferase, 6 trans- 
posases, and 43 homing endonucleases. Furthermore, inteins exist in 
two of the NY-2A gene products, the a-subunit of ribonucleotide reduc- 
tase and a putative helicase (Fitzgerald, L. A. et al., manuscript in 
preparation). Not all chlorovirus genes are required for virus replica- 
tion in the laboratory. For example, extended deletions can occur in the 
chlorovirus genomes; 27—37-kbp deletions in PBCV-1 (Landstein et al., 
1995) and 30—42-kbp deletions in CVK1 (Songsri et al., 1995) are 
located in the left terminus of the genome. A detailed comparison of 
the gene contents between viruses should identify a set of highly 
conserved genes and variable or dispensable genes. 





TABLE I 
COMPARISON OF THE GENOME ORGANIZATION AMONG SEVERAL CHLORELLA VIRUSES 
Size of 
terminal No. of No. of 
G+C inverted protein- tRNA 
Genome content repeats encoding encoding 
Viruses size (bp) (%) (kbp) genes genes 
PBCV-1 330,744 39.6 2.2 367 11 
NY-2A 368,683" 40.3 2.4 ~420 7 
A158R 344,690“ 40.4 0.27 ~400 6 
MT325 314,335“ 44.9 ND ~330 10 
CVK2 ~350,000* ND 2.4 ND 14 


t Tentative size to be verified. 

* Tentative size based on pulsed-field gel electrophoresis (PFGE)-estimation. 

ND, Not determined. * Unpublished results, can be seen at Web site http://greengene. 
uml.edu. 
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A. Hairpin Ends and Inverted Terminal Repeats 


The 35-nucleotide sequence of the PBCV-1 hairpin end differs 
considerably from the hairpin loop sequence (43 nucleotides) of 
Chlorovirus CVK2 (Hiramatsu, S., and Yamada, T., unpublished re- 
sult) (Fig. 2). It is interesting that a single base change in the most 
distal position of the CVK2 sequence reduces the loop to 33 nucleo- 
tides. A region of 15-16 bp immediately adjacent to the hairpin end in 
the inverted terminal repeats also differs between PBCV-1 and CVK2; 
however beyond this region, the inverted repeat sequences are nearly 
identical between the two viruses. 

The terminal inverted repeats are 2.2-2.4 kbp in size (Table II). 
Although the inverted repeats of PBCV-1 and CVK2 are similar, this 
is not true for all viruses. The PBCV-1 inverted repeat region was 
hybridized to 36 other NC64A viruses. Twenty-eight hybridized very 
well to the probe, however, eight did not hybridize at all, indicating 
sequence differences. Such rearrangements may be mediated by 
numerous short repeated sequences that frequently occur at the junc- 
tions between the inverted repeats and the single copy region (Yamada 
and Higashiyama, 1993). Comparison of the nucleotide sequence of the 
inverted repeats among chloroviruses PBCV-1, CVK2, NY-2A, and 
AR158 indicated no significant identity in the terminal inverted 
repeats sequences except for occasional homology islands, a ~600-bp 
region next to the hairpin loop and a 400~600-bp region immediately 
adjacent to the single copy region (Nishida et al., 1999b). 


CVK2 


SS y << u 
GGGCCCTGTTTTTACHGTGGTCAACACATTYACCACTGTCAAT INT INT ht — Goacce 


AGCGCTCGTATATTGCCGTTTTAATATAGGCTA TCTACGCGCG 

TCGCGAGCATATAACGGCAAAATTATATCCGA TAGA TGCGCGC 
TAGGTTAGATAATCTITATAAATCGATTTACCGAC 
GTCGGTAAATCGATTTATAAAGATTATCTAACCTA 





PBCY-1 
GGGCCCTGTITITAC I GIGGTCAACACATTACCACTCTCAATI UTE bateeteebeteeaTIcC......cccccc 


— - - — 


Fic 2. Comparison of the nucleotide sequences at the PBCV-1 and CVK2 terminal 
hairpin ends. The most distal portions of the terminal inverted repeats (shown by 
arrows) are relatively similar between PBCV-1 and CVK2 DNAs (unidentical bases are 
shown in red), whereas the sequences of the covalently closed hairpin loops (35 nucleo- 
tides for PBCV-1 and 43 nucleotides for CVK2) are completely different. The hairpin 
loops exist in one of two forms (shown in blue and green); the two forms are complemen- 
tary when the sequences are inverted (flip-flop) (Zhang et al., 1994). 


OTE 


DNA replication, recombination and repair 
DNA polymerase (6) 
DNA primase 
ATP-dependent DNA ligase 
DNA topoisomerase II 


PCNA 


RNase H 


Replication factor C (Archae large subunit) 


Helicase (superfamily ITT) 


Exonuclease 


Pyrimidine dimmer-specific glycosylae 
DNA packaging ATPase 


TABLE II 


REPRESENTATIVE ORFs ENCODED BY CHLORELLA VIRUS PBCV-1* 


Signaling, ion channels and protein kinases 


A185R (918) Potassium ion channel protein A250R (94)! 
A468R (443) Ligand-gated channel protein A162L (411) 
A544R (298)' A163R (433) 


A583L (1061)' 


A193L (262) 
A57A4L (264) 
A399R (194) 
A417L (429) 


A456L (654) 
A166R (268)' 
A50L (141) 

A392R (258) 


DNA methyltransferase and site-specific endonucleases 


Adenine DNA methylase 


Cytosine DNA methylase 


A251R (326)' 
A581R (265)' 
A517L (344)' 


A530R (335) 


A683L (367) 


Ser/Thr protein kinase 


Tyr-protein kinase 
Tyr-protein phosphatase 


Nucleotide metabolism 
Aspartate transcabamylase 


Ribonucleotide reductase 
large subunit 


Ribonucleotide reductase 
small subunit 


Thioredoxin 


A34R (308) 
A248R (308)' 
A277L (303) 
A278L (610) 
A282L (569) 
A289L (283)' 
A614L (577)' 
AG617R (321) 
A305L (204) 


A169R (323)' 
AG629R (771) 


A476 (324) 


A427L (119) 


ITE 


DNA site-specific endonuclease 


Insertion and transposition 
Transposase 
Homing endonuclease (GIY-YIG) 


Homing endonuclease (HNH) 


Transcription 
Transcription factor IIB 
Transcription factor IID 
Transcription factor IIS 
VLTF2-type transcription factor 
RNA triphosphatase 
RNA guanylyltransferase 


A252R (342) 
A579L (183) 


A625R (433) 
A163L (165) 
A287R (251) 
A315L (246) 
A351L (358) 
A495R (221) 
A539R (173) 
A651L (230) 
A87R (456) 

A267L (314) 
A422R (342) 
A478L (310) 
A490L (310) 


A107L (290) 
A552R (270) 
A125L (180) 
A482R (215) 
A449R (193) 
A103R (330) 


Glutaredoxin 

dUTP pyrophosphatase 
dCMP deaminase 
Thymidylate kinase 
Nucleotide triphosphatase 
Cytidine deaminase 
Thymidylate synthase X 
ATPase 

ATPase 


Sugar manipulation enzymes 
Glucose dehydrogenase 
Glucosamine synthase 
Hyaluronan synthase 
Fucose synthase 


GDP-p-mannose 
dehydratase 


Mannosyltransferase 
Fucosyltransferase 
Glycosyltransferase 


A438 (78)' 
A551L (141) 
A596R (142)' 
A416R (188) 
A826L (209) 
A200R (118) 
AG674R (216)' 
A892R (258) 
A554L (271) 


A609L (389)' 
A100R (595) 
A98R (568) 

A295L (817) 
A118R (345) 


A64R (638) 
A114R (485) 
A111R (389) 
A222 /226R (432) 
A328L (355) 
A473L (517) 
A546L (321) 





(continues) 


GIE 


TABLE II (continued) 





Helicase (superfamily II) 


SWI/SNF helicase 

Skil helicase 

RNase III 

Histone H3, Lys 27 dimethylase 
Protein synthesis, modification, and degradation 

Translation elongation factor III 

Prolyl-4-hydroxylase 

Protein disulfide isomerase 

Thiol oxidoreductase 

Ubiquitin C-terminal hydrolyase 

SKP-1 protein 

Zn metallopeptidase 


A153R (459) 
A363R (811) 
A548L (458) 
A241R (725) 
A4G6A4R (245) 
A612L" 


A666L (918) 
A85R (242)' 
A448L (106) 
A465R (118) 
A105L (284) 
A89L (151) 

A604L (134) 


Cell wall digestion 
Chitinase 


Chitosanase 
G-1,3-Glucanase 
Polysaccharide lyase 


Lipid manipulation enzymes 
Glycerphosphory] diesterase 
2-Hydroxyacid dehydrogenase 
Lysophospholipase 
N-Acetyltransferase 


A181 /182R (830) 
A260R (484) 
A292L (328)' 
A94L (364) 
A215L (321) 


A49L (219) 
A53R (363) 
A271L (159) 
A654L (197) 


* ORFs for structural proteins including the major capsid protein Vp54 (A430L, 437 aa) and its related proteins are not listed. The 
number in brackets refers to the number of codons in the ORF. 
t A dagger means that the recombinant protein has been produced and shown to have the expected activity. 
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Yamada and Higashiyama (1993) detected a site-specific nick in the 
inverted repeat region of Chlorovirus, CVK1. This nick might serve as 
the initiation point for DNA replication. 


B. PBCV-1 Genes 


In the initial report describing the sequence of the PBCV-1 genome, 
702 ORFs of 65 codons or larger were identified and 377 of them were 
predicted to encode proteins (Kutish et al., 1996; Li et al., 1995, 1996; 
Lu et al., 1995, 1996). However, mistakes are being detected in the 
original sequence; often two adjacent ORFs consist of a single ORF. 
Currently we believe that PBCV has 691 ORFs of 65 codons or lar- 
ger, of which 366 are protein-encoding. PBCV-1 protein-encoding genes 
were identified initially by the following criteria: (1) A minimal size of 
65 codons initiated by an ATG codon. (2) The largest ORF was chosen 
when competing ORFs overlapped. (3) ORFs with AT-rich (>70%) 
sequences in the 50 nucleotides upstream of the putative initiation 
codons. To date, most of the protein-encoding genes have met these 
criteria. 

Unlike the poxviruses, in which genes near the terminal regions are 
transcribed toward the termini (Moss, 1996), the 366 PBCV-1 putative 
protein-encoding genes are evenly distributed on both strands and, 
with one exception, intergenic space is minimal. In fact, 275 ORFs 
are separated by less than 100 nucleotides. The exception is a 1788 
nucleotide sequence near the middle of the genome. This DNA region, 
which contains many stop codons in all reading frames, encodes 11 
tRNA genes. The 2.2-kb inverted terminal repeat region of the PBCV-1 
genome contains four ORFs, which are duplicated (Lu et al., 1995). 
Approximately 50% of the 366 PBCV-1 gene products have been tenta- 
tively identified, including some that seem irrelevant to virus replica- 
tion. Some PBCV-1 genes are closely related to genes of bacteria and 
their viruses, whereas other PBCV-1 genes appear eukaryotic in ori- 
gin. Consequently, the chlorella virus genomes contain an interesting 
mosaic of prokaryotic and eukaryotic genes. Some of the PBCV-1 gene 
products are listed in Table II. Additional comments on these genes 
can be found in other reviews (Van Etten, 2003; Van Etten and Meints, 
1999; Van Etten et al., 2002). 


C. Linearity of Gene Arrangements in the Chlorella Viruses 


To understand the fundamental organization of the chlorovirus gen- 
omes and to identify essential genes for viral replication, it will be 
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necessary to separate highly conserved regions from variable and/or 
dispensable regions. Nishida et al. (1999b) compared the gene arrange- 
ment between PBCV-1 and CVK2. Four major variations were detected: 
(1) insertion of an approximately 20-kbp sequence near the left end 
of CVK2, (2) a duplication of the gene for the major capsid protein in 
CVKa2, (3) deletions/insertions of some ORFs, and (4) a divergence in the 
terminal inverted repeat sequences. Despite these changes, colinearity 
was maintained for most of the PBCV-1 and CVK2 genes. 

The recent sequencing of three more chlorella viruses make it possi- 
ble to compare gene arrangements over their entire genomes. Figure 3 
compares the positions of 22 randomly chosen genes among four chlo- 
rella viruses. Although a few minor rearrangements occur (Section 
VII.D.2), in general, colinearity exists among the NC64A viruses 
(Fig. 3A). In contrast, there is almost no collinearity between the 
genomes of PBCV-1 and MT325, a Pbi virus (Fig. 3B). Some PBCV-1 
genes that are absent in NY-2A and AR158, such as A245R and A646L, 
are present in MT325, whereas the genes for A544R and A604L that 
are conserved in NC64A-viruses are missing in MT325. A detailed 
comparison of the gene contents and the genome architecture between 
the two major groups of chloroviruses is in progress (Fitzgerald, L. A. 
et al., manuscript in preparation). 


0 50 100 150 200 250 300 330 350 kbp 
1 #23 456 78 9 10 11 12 13 141516 47 18 1920 21 22 1 
| \l oe Al \ (> Mpa \ ALZ \ \4/ NN f oppev-1 
(330 kbp) 
A cvK2 
(350 kbp) 
41 23 456 78 9 10 11 12 13 141516 17 18 192021 22 1 
| | | \ á PBCV-1 





(330 kbp) 


MT325 
(310 kbp) 


Fic 3. Comparison of the location of 22 genes in four Chlorovirus genomes: between 
PBCV-1 and CVK2 (A), and PBCV-1 and MT325 (Pbivirus) (B). Corresponding gene 
positions are connected by lines. PBCV-1 genes used as landmarks are: (1) A3R, (2) 
A35L, (3) A50L, (4) A100R, (5) A107L, (6) A125L, (7) A181R, (8) A185R, (9) A245R, (10) 
A292L, (11) tRNA genes, (12) A357L, (13) A413L, (14) A430L, (15) A448L, (16) A464R, (17) 
A533R, (18) A575L, (19) A577L, (20) A604L, (21) A646L, and (22) A666L. Some PBCV-1 
genes are duplicated or missing in the other virus genomes. Although a few minor 
rearrangements occur (ex. a large insertion between (2) and (3) in CVK2), in general, 
colinearity exists among the NC64<A viruses (A). In contrast, there is almost no colinearity 
between the genomes of PBCV-1 and MT325 (B). Unpublished sequence data for NY-2A 
and MT325 (J. L. Van Etten and M. V. Graves) are used in the comparison. 
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D. Patterns of Gene Rearrangement 


1. Large Deletions / Insertions 


The CVK2 genome is approximately 20 kbp larger than the PBCV-1 
genome, primarily because of an extra 22.2-kbp sequence close to the 
left terminus (Fig. 3A) (Chuchird et al., 2002). This 22.2-kbp region has 
five gene copies of the Vp260-like protein, a possible viral-surface 
glycoprotein. Although none of these copies occur in the corresponding 
region in the PBCV-1 genome, four and three copies of the Vp260-like 
protein-encoding genes are also located together at almost the same 
position in the NY-2A and AR158 genomes. These rearrangements do 
not appear to be mere insertions/deletions but more complicated gene 
replacement events (Chuchird et al., 2002). 

Four PBCV-1 spontaneously derived antigenic variants were 
isolated that contain 27—37-kbp deletions at the left end of the 330- 
kbp genome (Landstein et al., 1995). Two of these mutants had dele- 
tions that began at nucleotide positions 4.9 or 16 kb and ended at 
position 42 kb. The two deletions probably resulted from recombina- 
tion at a repeated sequence. The other two mutants, which probably 
arose from nonhomologous recombination, lacked the entire left- 
terminal 37 kb of the PBCV-1 genome, including the 2.2-kbp terminal 
inverted repeats. The deleted left terminus was replaced by the trans- 
position of an inverted 7.7- or 18.5-kbp copy from the right end of the 
PBCV-1 genome. Similar 30—45-kbp deletions were also obtained with 
NC64A virus CVK1 after exposure of CVK1-infected cells to UV radia- 
tion (Songsri et al., 1995; Yamada and Higashiyama, 1993). These 
deletions also occurred in the left terminal portion of the virus genome, 
possibly by homologous recombination. These experiments indicate 
that 40—45 kbp (12-13% of the total genome) at the left end of PBCV-1 
and CVK2 genomes are unnecessary for viral replication in the 
laboratory. Similar deletion mutants occur in the poxviruses (Turner 
and Moyer, 1990) and ASFV (Blasco et al., 1989a, b); these virus 
genomes also have inverted terminal repeats and hairpin ends like 
the chloroviruses. The generation of chlorovirus deletions may be 
explained by the models proposed for deletions/transpositions in 
the poxvirus genomes (Shchelkunov and Totmenin, 1995; Turner and 
Moyer, 1990). 


2. Small Deletions / Insertions 


Small DNA deletions/insertions also often occur in chlorella virus 
genomes. Typically these events consist of one or two genes. An exam- 
ple is seen at the ORF A430L(Vp54)-A431L-432L region in the PBCV-1 
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genome. The entire A431L-like sequence is missing from this gene 
cluster in the CVK2 genome, leaving a two-gene cluster A430L-432L 
(Nishida et al., 1999b). In addition a duplicate copy of A430L (A430L’) 
is inserted in the CVK2 genome at the region corresponding to PBCV-1 
ORF A452L-A454L, resulting in a cluster of A452L-A430L'-A454L 
(Nishida et al., 1999b). In this case, short 5'GTTTT or 5’/CAAAA 
sequences are located at the rearrangement points and are assumed 
to be involved in the rearrangements. Similarly, a region surrounding 
the PBCV-1 A250R (Kcv) gene serves as an example of such rearrange- 
ments; this region was sequenced in 17 NC64A viruses. In PBCV-1 the 
genes for A251R and A252R are located immediately downstream of 
A250R; however, these genes are absent in 12 of the 17 NC64A viruses 
(Kang et al., 2004a). In addition, two of the viruses had an extra ORF 
inserted between A248R and A250R. Many other examples of small 
insertions/deletions occur in the chlorella viruses. For example, virus 
NyY-2A encodes 18 DNA methyltransferases whereas PBCV-1 has 5 
DNA methyltransferases and 7 DNA restriction/modification enzymes 
(Fitzgerald, L. A. et al., manuscript in preparation). 


3. Gene Amplification 


Eighty-four of the PBCV-1 ORFs resemble one or more other PBCV-1 
ORFs forming 26 families. Thirteen families have two members, eight 
families have three members, three families have six members, and two 
families have eight members. One six-member family contains multiple 
ankyrin-like repeats (Peters and Lux, 1993). Five members in another 
family encode proteins that resemble the PBCV-1 major capsid protein 
Vp54, although these genes do not hybridize to one another on Southern 
blots. These observations suggest some gene amplification mechanisms 
probably exist in the PBCV-1 genome. Additional examples of gene 
amplification in other chloroviruses, such as Vp260, are described 
in an earlier section. It will be interesting to identify the regions in the 
370-kbp NY-2A genome that are expanded as compared to the 330-kbp 
PBCV-1 genome. However, it is known that the region corresponding to 
A292R-A330R in PBCV-1 contains genes that are amplified extensively 
in NY-2A; A328L (two times), A154L (5 times), A354R (10 times), and 
A315L (12 times). These amplified genes are at least one reason that the 
NY-2A genome is larger than PBCV-1. 


4. Gene Replacements 


Comparison of the gene arrangements between chloroviruses has 
also revealed gene replacements. In the CVK2 genome, a single ORF 
(corresponding to PBCV-1 A330R) is replaced with a 5-kbp sequence 
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containing the genes encoding chitin synthase (chs), UDP-glucose 
dehydrogenase (ugdh), and two other ORFs (Ali et al., 2005). In 
PBCV-1, the hyaluronan synthase gene (has, A98R) is located at nu- 
cleotide positions 51-53 kbp in a cluster of A93L-A94L (-1,3-gluca- 
nase)-A98R (hyaluronan synthase)-A100R (glucosamine synthase) (Li 
et al., 1995). A similar gene cluster occurs at the corresponding region 
in the CVK2 genome; however, the two internal ORFs are replaced 
with different genes, bgl2 encoding another (-1,3-glucanase and chs2 
encoding another chitin synthase. The latter arrangement is also 
found in the NY-2A and AR158 genomes (Ali et al., 2005). 

Often these rearrangement events include a set of genes, and it is 
certainly possible that the set may have genes whose products are 
functionally related, for example, like restriction endonucleases and 
their companion DNA methyltransferases. In fact, the ORF immediate- 
ly adjacent to chs2 described previously resembles a chitin deacetylase 
(Ali et al., 2005). 

To summarize, the different sizes of the Chlorovirus genomes as well 
as their large and small deletions and insertions suggest that dynamic 
and frequent rearrangements of virus genomes occur in natural envir- 
onments. These variations probably result from several mechanisms. 
The fact that the left end of the Chlorovirus genome is tolerant to 
deletions/insertions/rearrangements suggests that a recombinational 
“hotspot(s)” in this region allows viruses to exchange genes among 
themselves and possibly with their host(s). However, despite these 
differences, the location of most PBCV-1 genes, many of which are 
probably housekeeping genes, is nearly colinear in the 330—370-kb 
NC64A viruses, suggesting similar overall genome organization 
between these Chlorovirus isolates. Given the fact that the chlorella 
viruses often encode multiple transposases and homing endonucleases, 
one might expect the virus genomes to be unstable. However, this does 
not appear to be the case, at least in the laboratory. 


VIII. CELL WALL DIGESTION: CHITINASE, CHITOSANASE, POLYSACCHARIDE 
LYASE, AND 3-1,3-GLUCANASE 


In a normal lytic cycle, PBCV-1 attaches to the surface of host 
chlorella cells and degrades the cell wall at the point of attachment; 
the viral core is then released into the host cytoplasm, leaving an 
empty capsid on the cell wall (Meints et al., 1984). Within 6-8 hpi, 
nascent viruses exit the cells after cell lysis. Thus, both the initial and 
final stages of the PBCV-1 replication cycle require cell wall-digesting 
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activities. A common characteristic of virus-sensitive chlorella strains 
is a rigid cell wall containing glucosamine in addition to other sugars; 
glucosamine comprises 7—17% of the total sugars in the cell wall 
(Kapaun and Reisser, 1995; Kapaun et al., 1992; Meints et al., 1988; 
Takeda, 1988, 1995). The presence of glucosamine suggests that 
enzymes degrading polymers of glucosamine, like chitin (G-1,4-linked 
polymer of N-acetyl-p-glucosamine) and chitosan ((3-1,4-linked poly- 
mer of D-glucosamine with various degrees of N-acetylation), might 
be involved in the viral infection process. In fact, when Chlorovirus 
CVK2 proteins are separated into the capsid and core particle fractions 
(Songsri et al., 1997; Yamada et al., 1997), and assayed by SDS-PAGE 
with chitosan or chitin as a substrate in the gel matrix (Yamada et al., 
1997), several enzymatically active proteins with molecular masses 
ranging from 35 to 70 kDa are detected in the core fraction. Of these, 
a 65-kDa protein has the most chitosanase activity and a few 50- 
60-kDa proteins have chitinase activities. Moreover, three PBCV-1 
ORFs have significant amino acid sequence identities with bacterial 
chitinases and chitosanases (Lu et al., 1996). Yamada et al. (1997) 
chracterized a chitosanase (vChta-1) gene from virus CVK2. This gene, 
which corresponds to PBCV-1 ORF A292L, encodes two functional 
chitosanase proteins with apparently different roles in virus replica- 
tion. The larger 65-kDa chitosanase is packaged in the virion and 
presumably functions during infection. In contrast, the smaller 37- 
kDa enzyme remains in the host cytoplasm, where it most likely aids 
in cell wall digestion during viral release. The enzymatic characteriza- 
tion of the PBCV-1 A292L protein has also been described (Sun et al., 
1999). 

As for the chitinase genes, a PBCV-1 ORF (A181/182R) encodes 
an active chitinase with two catalytic domains; the enzyme belongs 
to the family 18 glycosyl hydrolases (Sun et al., 1999). A A181/182R 
homolog is present in CVK2 (Hiramatsu et al., 1999, 2000). The first 
catalytic domain resembles a catalytic sequence from Saccharopoly- 
spora (Streptomyces) erythraeus (30% identity) chitinase, whereas 
the second domain resembles a chitinase from Ewingelle americana 
(34.7% identity). The two catalytic domains are linked by a short, 
proline rich sequence. This structure suggests that the two vChti-1 
chitinase domains might have independent origins (Hiramatsu et al., 
1999). A C-terminal-truncated derivative of vChti-1, containing 
the first catalytic domain, produced chitobiose from either chitote- 
traose, chitohexaose, or high-molecular mass chitin; this product 
is typical of an exochitinase. In contrast, an N-terminal—truncated 
derivative of vChti-1 produced N-acetylglucosamine from chitobiose 
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as well as chitooligosaccharides. Therefore, this domain possessed 
N-acetylglucosaminidase activity as well as endochitinase activity. 
The presence of two catalytic domains with different enzymatic proper- 
ties in the viral enzyme may allow natural substrates to be hydrolyzed 
in a cooperative fashion (Hiramatsu et al., 2000). 

The CVK2 chitinase gene (vChti-1) is expressed in virus-infected 
cells beginning at 120 min pi. However, the 94-kDa protein product is 
not incorporated into virions but remains in the medium after cell lysis 
(Hiramatsu et al., 1999). Similar results were reported for the PBCV-1 
A181R/A182R protein (Sun et al., 1999). Another PBCV-1 ORF, A260R, 
also resembles a chitinase (Sun et al., 1999). This gene is expressed 
late in infection, and the 55-kDa protein is enzymatically active and 
packaged in PBCV-1 virions. All of these chitinase and chitosanase 
genes are widely conserved in the chlorella viruses, suggesting their 
importance in viral replication. 

However, Chlorella NC64A cells do not exhibit any drastic morpho- 
logical changes when treated with vChti-1 and/or vChta-1 chitosanase 
(Hiramatsu et al., 1999; Yamada et al., 1999). Therefore, additional 
virus encoded enzymes are probably involved in host cell wall diges- 
tion. To detect such cell wall-degrading activities, Æ. coli lysates 
expressing virus CVK2 genes were assayed by monitoring halo- 
forming activity using Chlorella NC64A cells as a substrate. These 
expreriments revealed two algal-lytic activities (vAL-1 and vAL-2) 
(Sugimoto et al., 2000). Val-1 transcription and translation products 
appear at 60 and 90 min pi, respectively. The vAL-1 protein is not 
incorporated into the viral particles but remains in the cell lysate, 
suggesting it is involved in cell wall digestion during viral release. 
Chlorella NC64A cell walls are digested by vAL-1 under physiological 
conditions. TLC and MALDI-TOF mass spectrometric analyses of the 
degradation products (oligosaccharides) revealed that the major oligo- 
saccharides had unsaturated p-glucuronic acid (GlcA) (C4 = C5) at 
the reducing terminus, and a side chain attached at C2 or C3 of GlcA 
(C4 = C5). The side chain consisted primarily of Ara, GlcNAc, and Gal. 
These results indicate that vAL-1 is a novel polysaccharide lyase, 
cleaving chains of either 8- or a-1,4-linked GlcAs (Sugimoto et al., 
2004). In addition to Chlorella NC64A, vAL-1 lysed cells of four 
C. vulgaris isolates as well as Chlorella SAG-241—80 (Chuchird et al., 
2001; Sugimoto et al., 2000). A PBCV-1 ORF (A94Z) encodes a protein 
with 26-30% amino acid identity with family 16 endo-Z-1,3-glucanases 
from several bacteria. A94L also has the critical amino acids in the 
catalytic site of family 16 endo-G-1,3- and endo-G-1,3—1,4-glucanases. 
The A94L recombinant protein hydrolyzed the G-1,3-glucan laminarin 
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and had slightly less hydrolytic activity on G-1,3-1,4-glucan lichenan 
and barley 3-glucan (Sun et al., 2000). The a94l gene is expressed early 
in PBCV-1 infection. Furthermore, the A94R protein appears 
early during PBCV-1 replication but then disappears by 120 min pi. 
The biological function of this G-1,3-glucanase is unknown as 
approximately 50% of chlorella viruses lack this gene. However, some 
chlorella viruses lacking A94L homologs contain another gene encod- 
ing an endo-/-1,3-glucanase (bgl2) (Ali et al., 2005). The functions of 
these genes and their products are unknown. 

The chlorella viruses may be a good source of polysaccharide digest- 
ing enzymes because cell wall polysaccharides can differ significantly 
among chlorella isolates (Yamada and Sakaguchi, 1982). Thus one 
predicts that some virual enzymes may have some unique activities. 

At the same time, the origin and evolution of the genes encoding the 
polysaccharide-degrading enzymes in the chloroviruses are intriguing. 
A gene encoding a chitinase, that shares significant amino acid 
sequence identity with vChti-1, was discovered in host Chlorella 
NC64A (Ali, A. M. et al., manuscipt in preparation). Comparison of 
the structure, nature and function of this gene product with vChti-1 
may reveal information on the origin and evolution of the viral genes. 


IX. GLYCOSYLTRANSFERASES AND SYNTHESIS OF POLYSACCHARIDES 
AND FUCOSE 


A. Glycosylation of the Virus Structural Proteins 


Virus PBCV-1 contains six possible glycosyltransferase encoding 
genes, a64r, a111 / 114r, a222 / 226r, a3281, a473l, and a546l (Van Etten, 
2003; Van Etten et al., 2002). None of these putative glycosyltrans- 
ferases have an identifiable signal peptide that would target them to 
the endoplasmic reticulum (ER) or Golgi. Furthermore, the cellular 
protein localization program PSORT predicts that four of these pro- 
teins are located in the cytoplasm and two have a transmembrane 
domain (Van Etten, 2003). The a64r gene encodes a 638 amino acid 
protein that has four motifs conserved in “Fringe type” glycosyltrans- 
ferases. Analysis of 13 PBCV-1 antigenic variants, with differences in 
the major capsid protein Vp54 glycans, have mutations in a64r that 
correlated with specific antigenic variations. Dual infection experi- 
ments with different antigenic variants established that wild-type 
PBCV-1 could be formed by complementation and recombination of 
the variants. These results led to the conclusion that a64r encodes a 
glycosyltransferase associated with Vp54 glycan synthesis (Graves 
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et al., 2001). Typically, viral proteins are glycosylated by host-encoded 
glycosyltransferases located in the ER and Golgi (Doms et al., 1993; 
Knipe, 1996; Olofsson and Hansen, 1998). Consequently, the glycan 
portion of virus glycoproteins is host specific. Therefore, glycosylation 
of PBCV-1 major capsid proteinVp54 differs from this paradigm. Addi- 
tional experiments to support this statement are provided in a review 
(Markine-Goriaynoff et al., 2004). 

These findings lead to several questions including: are the Vp54 
glycan precursors attached to a lipid carrier such as undecaprenol- 
phosphate which serves as an intermediate in bacterial peptidoglycan 
synthesis (Raetz and Whitfield, 2002) or dolichol diphosphate which 
serves the same function in eukaryotic cells (Reuter and Gabius, 
1999)? Could Vp54 glycosylation reflect an ancestral pathway that 
existed prior to the ER and Golgi formation? 


B. Extracellular Polysaccharide Synthesis 


The chloroviruses are also unusual because they encode enzymes 
involved in the biosynthesis of the linear polysaccharides hyaluronan 
(also called hyaluronic acid) and/or chitin. Typically, hyaluronan is 
only found in the extracellular matrix of vertebrates and capsules of 
a few pathogenic bacteria (DeAngelis, 1999, 2002). It is composed of 
~20,000 alternating (G-1,4-glucuronic acid and (-1,3-N-acetylglucosa- 
mine residues (DeAngelis, 1999). Unexpectedly, virus PBCV-1 contains 
genes encoding hyaluronan synthase (HAS) (DeAngelis et al., 1997) 
and two other enzymes involved in the synthesis of hyaluronan pre- 
cursors, glutamine:fructose-6-phosphate amidotransferase and UDP- 
glucose dehydrogenase (Landstein et al., 1998). All three genes are 
expressed early during PBCV-1 infection. These results led to the 
discovery that hyaluronan lyase-sensitive, hair-like fibers begin to 
accumulate on the surface of PBCV-1 infected host cells by 15 min pi 
(Graves et al., 1999). By 4 hpi, the infected cells are covered with a 
dense fibrous hyaluronan network. 

The has gene is present in many, but not all, chloroviruses isolated 
from diverse geographical regions (Graves et al., 1999), suggesting 
that not all chloroviruses encode hyaluronan. Surprisingly, many 
chloroviruses that lack a has gene, have a gene encoding a functional 
chitin synthase (CHS). Furthermore, cells infected with these viruses 
produce chitin fibers on their external surface (Kawasaki et al., 2002). 
Chitin, an insoluble linear homopolymer of (-1,4-linked N-acetyl- 
glucosamine residues, is a common component of insect exoskeletons, 
shells of crustaceans, and fungal cell walls (Gooday et al., 1986). 
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Some chloroviruses contain both has and chs genes, and form both 
hyaluronan and chitin on the surface of the infected cells (Kawasaki 
et al., 2002; Yamada and Kawasaki, 2005). Finally, a few chloroviruses 
probably lack both genes because no extracellular polysaccharides are 
formed on the host surface of cells infected with these viruses (Graves 
et al., 1999). 

The fact that many chloroviruses encode enzymes involved in extracel- 
lular polysaccharide biosynthesis suggests that the polysaccharides are 
important in the virus life cycle. At present this function is unknown; 
however, we have considered three possibilities. (1) The polysaccharides 
prevent uptake of virus-infected chlorella by the paramecium. Presum- 
ably, such infected algae would lyse inside the paramecium, and the 
released virions would be digested by the protozoan. This scenario would 
be detrimental to virus survival. (2) The viruses have another host 
that acquires the virus by taking up the polysaccharide-covered algae. 
(3) Virus-infected cells aggregate, presumably due to the extracellular 
polysaccharide. This aggregation, which could trap uninfected cells, 
might aid the virus in finding its next host. However, a complicating 
factor in understanding the biological role of these two polysaccharides 
is that some viruses apparently lack both genes. 

Ali et al. (2005) investigated the genetic relationship between “hya- 
luronan-synthesizing” and “chitin-synthesizing” viruses by character- 
izing two genomic regions in the chitin-synthesizing virus CVK2 and 
comparing them to PBCV-1. One region surrounds the CVK2 chs 
region and the other corresponds to the region containing PBCV-1 
has. A single PBCV-1 ORF (A330R) is replaced in the CVK2 genome 
with a 5-kb region containing chs, ugdh2 (a second gene encoding 
UDP-glucose dehydrogenase) and two other ORFs. In CVK2 the loca- 
tion of the PBCV-1 has gene is replaced with another chs gene (de- 
scribed in Section VII.C.4). Some chloroviruses lack ugdh. These 
results suggest that chloroviruses change from “has viruses” to “chs 
viruses” or from “chs viruses” to “has viruses” by exchanging genes 
(Fig. 4). 

These observations also indicate that there is no functional incom- 
patibility between the two genes or their gene products, that is, hya- 
luronan and chitin. These conclusions are interesting because it has 
been suggested that the has gene in vertebrates evolved from chitin 
synthase or cellulose synthase through the addition of a 8-1-3 glyco- 
syltransferase activity to a preexisting (3-1-4 glycosyltransferase 
enzyme and that the ability to synthesize hyaluronan occurred rela- 
tively recently in metazoan evolution (Lee and Spicer, 2000). Another 
chs-like gene was reported to be encoded by the 336-kbp Ectocarpus 
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Fic 4. A diagram illustrating the genetic differences between HA and/or chitin 
synthesizing chloroviruses (Ali et al., 2005). Line 1 shows the location of three PBCV-1 
genes involved in the synthesis of HA. Viruses CVIK1 and CVHA1 that synthesize both 
HA and chitin (Line 2) have a chitin synthesizing gene inserted in the position of the 
PBCV-1 A330R ORF. Viruses that only synthesize chitin have two chs genes. These 
viruses may still have a ugdh gene (viruses CVK2, NY-2A, and AR158 in Line 3) or lose 
it (virus CVSA1 in Line 4). It is equally likely that the events depicted could occur in 
reverse order, that is, from Line 4 to Line 1. 


siliculosus virus EsV-1 (Delaroque et al., 2001). EsV-1 also belongs to 
the Phycodnaviridae but is lysogenic, in contrast to the lytic chlorella 
viruses. No information is available on the expression or function of 
the EsV-1 chs-like gene. 

The discovery of viral encoded enzymes involved in polysaccharide 
synthesis leads to many questions such as: (1) what is the function of 
these extracellular polysaccharides? (2) Why do infected cells expend 
huge amounts of energy on these processes when they are going to lyse 
in a few hours? (3) How were these genes acquired by the viruses? 
(4) Which was the original form of viral-encoded polysaccharides, 
hyaluronan or chitin? (5) How are hyaluronan and chitin polymers 
synthesized in the host cells and translocated through the cell wall? 


C. Fucose Synthesis 


PBCV-1 also encodes enzymes involved in nucleotide sugar metabo- 
lism. Two enzymes encoded by PBCV-1, GDP-p-mannose 4,6 dehydratase 
(GMD) and the bifunctional GDP-4-keto-6-deoxy-D-mannose epimerase/ 
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reductase (GMER) comprise the highly conserved pathway that 
converts GDP-p-mannose to GDP-L-fucose. In vitro reconstruction 
of the biosynthetic pathway using recombinant PBCV-1 GMD and 
GMER produced in E. coli synthesized GDP-L-fucose (Tonetti et al., 
2003). Unexpectedly, however, the PBCV-1 GMD, which has been 
crystallized recently (Rosano et al., 2005), also catalyzes the NADPH- 
dependent reduction of the intermediate GDP-4-keto-6-deoxy-D-man- 
nose, forming GDP-p-rhamnose. Similar results were obtained with 
GMD and GMER encoded by virus CVK2 (Isono, K. et al., unpublished 
data). Both fucose and rhamnose are constituents of the glycans 
attached to the PBCV-1 major capsid protein Vp54; therefore, the 
virus might encode the enzymes to meet this need. However, both 
sugars are also components of the uninfected host cell wall (Meints 
et al., 1988). 


X. RELATIONSHIP OF CHLOROVIRUSES TO OTHER VIRUSES 


As mentioned in the introduction, the phycodnaviruses are members 
of the superfamily of viruses referred to as NCLDVs and that accumu- 
lating evidence indicates that the phycodnaviruses have a long evolu- 
tionary history, possibly beginning at the time eukaryotes separated 
from prokaryotes (over 3 billion years ago). The evidence includes: (1) 
Phylogenetic analyses of DNA polymerases place the algal virus 
enzymes near the root of all eukaryotic 6-DNA polymerases (Villarreal 
and DeFilippis, 2000). (2) Phylogenetic analyses of other PBCV-1 
encoded proteins often place the proteins near the root of their eukary- 
otic counterparts, for example, the K* channel protein Kev (Plugge 
et al., 2000) and ornithine decarboxylase (Shah et al., 2004). (3) Many 
Chlorovirus encoded proteins are either the smallest or among the 
smallest proteins in their class. Examples include the type II DNA 
topoisomerase (Dickey et al., 2005), Kev (Plugge et al., 2000), ornithine 
decarboxylase (Shah et al., 2004), and lysine di-methyltransferase 
(Manzur et al., 2003). These proteins could represent progenitors of 
their more complex relatives. (4) Some PBCV-1-encoded enzymes are 
more flexible than those from higher eukaryotic organisms. For exam- 
ple, some virus enzymes carry out two functions whereas more 
“advanced” organisms require two separate enzymes to accomplish 
the same tasks. One interpretation of this observation is that these 
virus proteins may be progenitor enzymes and thus more precocious 
than their highly evolved counterparts in higher eukaryotes, where 
two separate enzymes carry out the function of one virus enzyme. This 
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dual functionality in the PBCV-1 enzymes does not result from gene 
fusion. Examples include: (a) ornithine decarboxylase which decarbox- 
ylates arginine more efficiently than ornithine (Shah et al., 2004). (b) 
dCMP deaminase which also deaminates dCTP and dCDP, as well as the 
expected dCMP (Zhang, Y. et al., manuscript in preparation). (c) GDP-p- 
mannose 4,6 dehydratase not only catalyzes the formation of GDP-4- 
keto-6-deoxy-D-mannose, which is an intermediate in the synthesis of 
GDP-.-fucose, the enzyme also reduces the same intermediate to GDP- 
p-rhamnose (Tonetti et al., 2003). (5) Even though PBCV-1 encodes 
both prokaryotic- and eukaryotic-like proteins, the 40% G + C content 
is fairly uniform throughout the genome. This pattern suggests 
that most of the genes have existed together in the virus for a long time. 
(6) The major coat protein from several dsDNA viruses that infect 
all three domains of life, including bacteriophage PRD1, human adeno- 
viruses, and a virus STIV infecting the Archaea, Sulfolobus solfatari- 
cus, are structurally similar to that of PBCV-1. This finding led Benson 
et al. (2004) to suggest that all these viruses may have a common 
evolutionary ancestor, even though there is no significant amino 
acid sequence similarity among their proteins. (7) Finally, one of the 
earliest eukaryotic cells could have resembled a single celled alga (Yoon 
et al., 2004). 

Some evolutionary biologists have suggested that NCLDVs may be 
the origin of the nucleus in eukaryotic cells (Bell, 2001; Pennisi, 2004; 
Villarreal, 2005), whereas other biologists (Raoult et al., 2004) have 
suggested that Mimivirus, and by inference the NCLDVs, may repre- 
sent a fourth domain of life. This later hypothesis resulted from a 
phylogenetic tree of life derived from the concatenated sequences 
of seven universally conserved proteins sequences: arginyl-tRNA syn- 
thetase, methionyl-tRNA synthetase, tyrosyl-tRNA synthetase, RNA 
polymerase II largest subunit, RNA polymerase II second largest sub- 
unit, PCNA, and 5’—3’ exonuclease. Mimivirus formed a branch near 
the origin of the Eukaryotic domain. 

Continuing with this hypothesis, perhaps the NCLDVs originally 
arose from a more complex ancestor and this progenitor ancestor 
(virus) became associated with different organisms that evolved into 
various eukaryotic lineages. These lineages placed various demands 
on the evolving NCLDVs resulting in the loss of some genes, either by 
donating genes to their hosts or the genes were no longer required for 
replication of the evolving viruses. The net result is that the NCLDVs 
now only have nine common genes. Of course over 3 billion years of 
evolution the viruses also acquired selected genes from their hosts that 
were necessary for survival. 
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Eight of the 10 viruses with the largest sequenced genomes 
are NCLDVs and 6 of them are members of the Phycodnaviridae 
(http://giantvirus.org). The largest algal virus genome sequenced to 
date infects Emiliania huxleyi and has a genome of 407 kb that con- 
tains ~470 protein-encoding genes (Wilson et al., 2005a). However, 
larger algal virus genomes exist; examples include lytic viruses 
specific for Phaeocystis pouchetii (PpV) (Jacobsen et al., 1996), Chry- 
sochromulina ericina (CeV-01B) (Sandaa et al., 2001), and Pyramimo- 
nas orientalis (PoV-01B) (Sandaa et al., 2001), which have genomes of 
approximately 485, 510, and 560 kb, respectively. To put the size 
of these viral genomes into perspective, the smallest bacterium, Myco- 
plasma genitalium has a genome of 580 kb that contains ~470 protein- 
encoding genes (Fraser et al., 1995) and the smallest Archaea, 
Nanoarchaeum equitans, has a 490-kb genome that contains ~550 
protein-encoding genes (Waters et al., 2003). Estimates of the mini- 
mum genome size required to support life are ~250 protein-encoding 
genes (e.g., Itaya, 1995; Mushegian and Koonin, 1996). The Mimivirus 
genome is actually larger than 25 microbial genomes currently in the 
databases (http://giantvirus.org). Thus a big question is: what differ- 
entiates large complex DNA viruses from small obligate intracellular 
microorganisms? It is also obvious that metagenomic sequencing pro- 
jects are identifying many more genes associated with large dsDNA 
viruses (Breibart et al., 2002; Tyson et al., 2004; Venter et al., 2004). 
Without doubt many of these will be algal infecting viruses (Suttle, 
2005). 

The presumed long evolutionary history of the NCLDVs can also 
explain why the phycodnaviruses are so diverse. Of the six genera in 
the family Phycodnaviridae, members of two genera in addition to the 
chlorella viruses have been sequenced, the circular Emiliania huxleyi 
virus (EhV-1) 407-kb genome (Wilson et al., 2005a) and the linear 
Ectocarpus siliculosus virus (EsV-1) 335-kb genome (Delaroque et al., 
2001). Including PBCV-1, each of these three viruses codes for several 
hundred proteins (Dunigan et al., 2006). However, it is truly amazing 
that only 14 protein-encoding genes are common to all three viruses. 
This means that just these three viruses contain more than a 1000 
unique protein-encoding genes. Despite the large genetic diversity in 
these three sequenced phycodnaviruses, phylogenetic analyses of their 
6-DNA polymerases (Chen and Suttle, 1996; Wilson et al., 2005a) anda 
superfamily of archeao-eukaryotic primases (Iyer et al., 2005) indicate 
that the phycodnaviruses fall into a monophyletic clade within the 
NCLDVs. 
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The long evolutionary history can also explain the diversity among 
viruses that infect Chlorella NC64A or Chlorella Phi. The amino acid 
sequence of protein homologs between Chlorella NC64A infecting 
viruses can differ by as much as 25%. This difference can be ~50% 
between homologs from NC64A and Pbi viruses. This diversity can be 
used to identify amino acid substitutions that alter the functional prop- 
erties of proteins, for example as has been done with the chlorovirus 
encoded K* channels (Gazzarrini et al., 2004; Kang et al., 2004b). 

In addition to the genomic sequence of PBCV-1, sequences of several 
other chlorella virus geneomes are now available, and they make it 
possible to determine a pan-genome of chlorella virus, which consists 
of a core genome shared by all isolates and a dispensable genome 
consisting of partially shared and isolate-specific genes. The core 
genes, which are highly conserved among different viral isolates, re- 
flect their importance in the basic infection cycle. The variable genes 
are related to recent growth history and the range of infection of 
individual virus isolates. One outcome of this diversity is that the total 
number of Chlorovirus protein-encoding genes is much larger than 
that from any one virus. 


XI. PERSPECTIVES 


It has been ~25 years since the first chloroviruses were described. 
Sequencing the 330-kb PBCV-1 genome about 10 years ago revealed, 
and is continuing to reveal, many unexpected genes such as genes 
encoding proteins involved in polysaccharide biosynthesis, polyamine 
biosynthesis, and ion transport proteins. The discovery of these genes, 
and the fact that many of their gene products are easy to work with, 
has led to an expansion in the number of investigators who are study- 
ing chloroviruses and their genes. However, many fundamental events 
in chlorovirus replication have only begun to be studied as indicated 
by some of the questions posed in this review. The biggest handicap 
facing studies on the chloroviruses, as well as all of the phycodna- 
viruses, is the lack of a functional molecular genetic system. However, 
new tools are coming into play with the chlorella viruses. For example, 
microarrays containing all of the putative PBCV-1 protein-encoding 
genes are now being used to study PBCV-1 gene expression, protei- 
nomic studies are identifying all of the virion-encoded proteins pack- 
aged in the virion, and the USA Department of Energy has agreed to 
sequence the PBCV-1 host Chlorella NC64A. These new tools should 
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lead to a greater understanding of virus-host relationships in the near 
future. 
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ABSTRACT 


The ability to regulate cellular gene expression is a key aspect of the 
lifecycles of a diverse array of viruses. In fact, viral infection often 
results in a global shutoff of host cellular gene expression; such inhibi- 
tion serves not only to ensure maximal viral gene expression without 
competition from the host for essential machinery and substrates but 
also aids in evasion of immune responses detrimental to successful 
viral replication and dissemination. Within the herpesvirus family, 
host shutoff is a prominent feature of both the a- and y-herpesviruses. 
Intriguingly, while both classes of herpesviruses block cellular gene 
expression by inducing decay of messenger RNAs, the viral factors 
responsible for this phenotype as well as the mechanisms by which it 
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is achieved are quite distinct. However, data suggest that the host 
shutoff functions of a- and y-herpesviruses are likely achieved both 
through the activity of virally encoded nucleases as well as via modula- 
tion of cellular RNA degradation pathways. This review highlights the 
processes governing normal cellular messenger RNA decay and then 
details the mechanisms by which herpesviruses promote accelerated 
RNA turnover. Parallels between the viral and cellular degradation 
systems as well as the known interactions between viral host shutoff 
factors and the cellular RNA turnover machinery are highlighted. 


I. INTRODUCTION 


In eukaryotic gene expression, many factors influence the levels of 
individual mRNAs creating a controlled environment that permits 
continuous expression of certain transcripts while allowing the expres- 
sion of others to occur only at specified times. While discussion of this 
regulation once focused exclusively on transcriptional regulation, the 
important role of posttranscriptional controls in the regulation of gene 
expression is now widely accepted. A central pillar in the edifice of 
posttranscriptional regulation is the control of mRNA stability, which 
is governed by a complex and sophisticated machinery that acts via 
several different biochemical pathways. Given the importance of this 
regulation for cells to function appropriately under “normal” and 
stressful conditions, it is not surprising that several viruses have 
evolved means to modulate this cellular regulatory system to facilitate 
their own replication and immune evasion strategies. 

The first portion of this review will focus on our current understand- 
ing of regulated mRNA turnover in mammalian cells. We will then 
detail how various members of the herpesvirus family promote wide- 
spread turnover of cellular mRNA, both via the action of virally en- 
coded ribonucleases and through the presumed deregulation of cellular 
mRNA turnover pathways. 


II. REGULATION OF CYTOPLASMIC MRNA TURNOVER IN MAMMALIAN CELLS 


A. Overview 


The lion’s share of our current knowledge of eukaryotic mRNA 
turnover comes from extensive research in yeast, although the last 
several years have shown a significant increase in the identification 
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and characterization of the parallel pathways in mammalian cells. 
Eukaryotic mRNA molecules are protected at their 5’ end by the 
presence of a m’GpppN cap and at their 3’ end by a poly(A) tail. Each 
of these structures is in turn bound by several cellular proteins that 
contribute to mRNA stability and translation; a cap-binding complex 
includes the cap-associated eIF4E, which in turn associates with the 
translation initiation factor eIF4G; the poly(A) tail is protected by the 
poly A binding protein (PABP). PABP also interacts with eIF4G, effec- 
tively circularizing the mRNA and further protecting the ends from 
exonucleolytic attack (Munroe and Jacobson, 1990; Wells et al., 1998). 
Events that disrupt these stabilizing interactions, such as (1) transla- 
tion or (2) recruitment of specific RNA binding proteins to messages 
with instability elements (see later) trigger degradation of the mRNAs. 
Thus, protein complexes engaged by the mRNA termini are essential 
for the control of message fate in cells. 

In yeast, normal cellular mRNA decay initiates with the removal of 
the poly(A) tail by deadenylation enzymes in what is often the rate- 
limiting step of the turnover (Meyer et al., 2004; Parker and Song, 
2004; Wilusz et al., 2001). Cellular decapping enzymes (including 
Dep1/Dcp2 and the Lsm complex) then remove the 5’ m’GpppN cap, 
thereby permitting access of the 5’ end of the RNA to the Xrn1p 5/—3’ 
ribonuclease that degrades the body of the message. In addition, yeast 
also possess a secondary pathway that promotes degradation in the 
3'—5' direction by the yeast exosome, an enzymatic complex that in- 
cludes numerous 3’—5’ RNases, as well as additional proteins involved 
in RNA binding and helicase activity (Raijmakers et al., 2004). Many 
of these RNA degradation components have been shown to localize 
within discreet cytoplasmic foci termed P bodies or cytoplasmic bodies, 
suggesting that these may represent sites of mRNA turnover in cells 
(Sheth and Parker, 2003). 

Turnover of mRNA in mammalian cells appears to proceed similarly 
to yeast, in that it involves deadenylation, decapping, and exonucleo- 
lytic decay (Fig. 1). Many of the mammalian mRNA degradation com- 
ponents were identified based on homology to their yeast counterparts. 
However, a key difference between mammalian and yeast turnover lies 
in the relative importance of the 5’—3’ and 3'—5' exonucleolytic decay 
pathways; in mammalian cells, the exosome-mediated 3’—5’ pathway 
appears to predominate. However, the majority of evidence supporting 
this view has been acquired via in vitro studies, and it is not unlikely 
that this view may undergo revision in the future. For example, there 
does exist a mammalian homolog of the 5’/—3’ Xrn1 RNase, a strong 
indication that this pathway likely plays some role in mammalian 
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Fic 1. Pathways of mammalian mRNA turnover. The 3’ and 5’ ends of mRNAs are 
normally protected via their interactions with PABP and the cap-binding complex. 
However, events that disrupt these associations lead to exposure of the ends to cellular 
exonucleases. The first step of mammalian mRNA turnover involves deadenylation of the 
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systems as well (Bashkirov et al., 1997; Meyer et al., 2004). Nearly all 
of these factors have roles in both the nucleus and the cytoplasm, 
although the mRNA turnover function correlates primarily with their 
cytoplasmic localization. Several mRNA degradation enzymes, as well 
as the cap binding protein eIF4E, have been shown to colocalize with 
the RNA binding protein GW182 in cytoplasmic bodies (also called P 
bodies, GW bodies); it is postulated that these sites represent domains 
of posttranslational mRNA decay (Andrei et al., 2005; Cougot et al., 
2004; Eystathioy et al., 2002, 2003; Ingelfinger et al., 2002; van Dijk 
et al., 2002). As the majority of regulated cellular mRNA turnover 
occurs in the cytoplasm of cells, this review will focus on the cytoplas- 
mic pathways and their control. The following sections will describe 
the current state of knowledge regarding each stage of regulated 
mammalian mRNA turnover. 


B. Deadenylation 


Shyu et al. (1991) used transcriptional pulse-chase experiments to 
show that deadenylation of the c-fos mRNA occurred prior to degrada- 
tion of the body of the message. Couttet et al. (1997) subsequently 
measured a variety of liver mRNA poly(A) tail lengths using an RT- 
PCR approach and confirmed that shortening of mammalian mRNA 
poly(A) tails from 100 to 300 nucleotides (nt) to approximately 30—60 nt 
precedes decapping and decay of the remainder of the message. It is 
now established that deadenylation represents the first step of cyto- 
plasmic mRNA turnover. Several deadenylases have been identified in 
mammals: poly(A)-specific exoribonuclease (PARN, previously called 
DAN), the CCR4-NOT complex, the poly(A) nuclease complex (hPan2/ 
Pan3), and nocturnin. 

PARN, the best characterized deadenylase in mammalian cells, was 
originally identified as a deadenylating activity in HeLa cell extracts 
(Astrom et al., 1991) and was subsequently purified to homogeneity 
from calf thymus (Korner and Wahle, 1997; Martinez et al., 2000). 
PARN exists as a homo-oligomer (perhaps trimer) that specifically 
degrades poly(A) located at the 3’ end of messages in a processive 


caja 
poly(A) tail by deadenylases such as PARN or hPan. The majority of deadenylated RNAs 
then undergo 3’—5’ degradation by the exosome, followed by cap removal by the scaven- 
ger decapping enzyme DcpS. Alternatively, rather than undergoing exosomal decay, 
some deadenylated RNAs are first decapped by the Dcp1/2 decapping complex (assisted 
by the Lsm proteins; not shown) then degraded in a 5'—3' manner by the Xrn1 nuclease. 
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manner is dependent on divalent cations (preferably Mg”*) and is 
inhibited by the presence of PABP on the poly(A) tail (Ford et al., 
1997; Martinez et al., 2000). The human PARN cDNA was shown to 
exhibit homology to RNase D-like enzymes of the DEDD superfamily 
family of 3’ RNA and DNA exonucleases, and site-directed mutational 
analyses confirmed that four conserved amino acids in PARN aligning 
with the RNase D motifs involved in catalysis (Asp-28, Glu-30, 
Asp-292, Asp-382) are essential for PARN activity via binding of diva- 
lent metal ions (Korner et al., 1998; Ren et al., 2002, 2004). Western 
blots and immunofluorescence assays demonstrate that, in HeLa 
cells, PARN is a predominantly cytoplasmic protein of approximately 
75 kDa. PARN appears to have homologs in a diverse range of organ- 
isms including plants and Caenorhabditis elegans but not in yeast. The 
development of a HeLa $100-based in vitro degradation system that 
faithfully reproduced regulated mRNA turnover (Ford and Wilusz, 
1999) facilitated subsequent analyses of the mechanism of PARN ac- 
tivity at the molecular level. The observation that immunodepletion of 
PARN from HeLa extracts greatly reduced deadenylation of substrate 
RNAs suggested that PARN represented the major deadenylase, at 
least in vitro (Gao et al., 2000). Significantly, in HeLa and Xenopus 
oocyte extracts, PARN also associates with the 5’ cap on mRNAs in a 
manner that is blocked by eIF4E and stimulated by the poly(A) tail, 
and this interaction enhances its deadenylation activity (Dehlin et al., 
2000; Gao et al., 2000). Thus, PARN appears to interact simultaneous- 
ly with both the 5’ and 3’ ends of mRNA. Martinez et al. (2001) 
demonstrated that PARN activity is stimulated by cap structures 
provided in cis (attached to the 5’ end of the mRNA) or in trans at 
low concentrations, but it is inhibited by high concentrations of cap 
analog added in trans (Martinez et al., 2001). They propose a model 
whereby trimeric PARN associates with the poly(A) tail and one sub- 
unit also binds the 5’ cap; following that interaction, a conformational 
change in that subunit which would serve to activate the remaining 
two subunits and enhance the processivity of the enzyme. Low concen- 
trations of cap analog could stimulate the enzyme by helping recruit 
PARN to uncapped messages, whereas high concentrations might in- 
hibit it by either inactivating or blocking necessary interactions of the 
other subunits (Martinez et al., 2001). 

Collectively, these observations suggest that the 5’ and 3’ terminal 
interactions of a translatable mRNA protect the message from PARN- 
induced deadenylation; however, disruption of these associations, for 
example, during the process of ribosome scanning and translation, 
permits PARN access to the message. Additionally, PARN-mediated 
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deadenylation can occur in a translation-independent manner during 
decay of mRNAs containing destabilizing elements in their coding 
region or 3’ untranslated regions (UTRs), as discussed in a subsequent 
section of this review. It is important to note that the contribution of 
PARN to in vivo mRNA turnover has yet to be confirmed in mammalian 
cells. However, Seal et al. (2005) have shown that serum deprivation of 
human liver Hep G2 cells results in enhanced mRNA turnover that 
correlates with a decrease in cap-associated eIF4E and an increase in 
cap binding by PARN. Furthermore, they demonstrate that under 
these conditions PARN phosphorylation is increased whereas phos- 
phorylation of the cellular cap binding proteins was diminished; thus, 
PARN-mediated deadenylation in mammalian cells may be regulated, 
at least in part, by this posttranslational modification. 

The major deadenylase in yeast is the CCR4-NOT complex, a group of 
proteins including Ccr4p, Pop2p/Caflp, Caf40p, Caf130p, and Not1-5p 
that were originally identified as transcriptional repressors (Chen 
et al., 2001b; Collart, 2003). The Ccr4p component of the complex 
exhibits homology to Escherichia coli exonuclease III and the Pop2p/ 
Caflp subunit is a member of the RNase D-like DEDD family of 
nucleases. Yeast deletion mutants lacking Ccr4p, Pop2/Caflp, or 
Not2, -3, -4, or -5 exhibit a deadenylation defect phenotype, suggesting 
that these are all important for the in vivo function of the complex 
(Meyer et al., 2004). This deadenylase appears to be inhibited by Pab1p 
in yeast, similar to mammalian PARN (Tucker et al., 2002). Human 
homologs of the CCR4-NOT complex have been identified and appear 
to be ubiquitously expressed in human tissues (Albert et al., 2000). The 
ability of hNot3 and hNot4 to partially complement yeast Not3/4 mu- 
tants suggests that these enzymes are functionally conserved. Fur- 
thermore, recombinant human Cafl and Pop2 purified from E. coli 
exhibit in vitro deadenylase activity and have been shown to interact 
with Cer4 family members (Bianchin et al., 2005; Dupressoir et al., 2001). 
Significantly, experiments using small interfering RNAs (siRNAs) to 
knock down Ccr4 in NIH 3T3 cells have shown that in the absence of 
this factor deadenylation of c-fos reporter mRNAs in cells occurs with 
reduced rate (Yamashita et al., 2005). Thus, the available data are con- 
sistent with the idea that these deadenylases retain some functional 
similarities to their yeast counterparts and additionally may play a 
prominent role in mammalian mRNA turnover. 

The human cytoplasmic poly(A) nuclease complex (hPan) was 
identified based on its homology to the yeast Pan deadenylase 
(Uchida et al., 2004). As in yeast, hPan is composed of two proteins, 
the hPan2 catalytic subunit and the hPan3 regulatory subunit. hPan2 
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is an Mg”*-dependent 3'—5' distributive exonuclease with homology to 
RNase D; mutation of a conserved residue in the hPan2 RNase D 
catalytic domain abrogated its nuclease activity in vitro. In contrast 
to the other eukaryotic deadenylases, in yeast Pan activity is stimu- 
lated rather than repressed by Pablp (Sachs and Deardorff, 1992). 
This is believed to play a role in its putative function in the shortening 
or trimming of poly(A) tails just after their synthesis (Brown et al., 
1996). In accord with its activity in yeast, hPan3 has been shown to 
directly bind PABP in a manner that stimulates the activity of the hPan 
enzyme. Substrates from which PABP had been immunodepleted were 
degraded with significantly reduced efficiency, although they were rap- 
idly deadenylated upon addition of recombinant PABP in an hPan3- 
dependent manner (Uchida et al., 2004). The fact that Pan activity is 
enhanced by the presence of PABP and does not require the presence of 
a5’ cap suggests that this enzyme may be able to promote deadenylation 
during or prior to translation (Uchida et al., 2004). 

The most recently identified mammalian deadenylase is nocturnin, a 
Ccr4 homolog located in the photoreceptor cells of Xenopus laevis, where 
it is specifically expressed at night and contributes to the regulation of 
circadian rhythm (Baggs and Green, 2003; Chen et al., 2002; Green and 
Besharse, 1996). As with PARN, no nocturnin orthologs have been found 
in yeast, although murine nocturnin was shown to be expressed in a 
circadian pattern in numerous mouse tissues (Wang et al., 2001). 

Why would a cell require multiple independent enzymes to perform a 
single function? Although the answer to this question is not really known, 
it is postulated that the multitude of deadenylases allows for multiple 
tiers of regulation. Since deadenylation is the first and likely rate- 
limiting step of turnover, it makes sense that the cell would impose layers 
of redundancy to ensure timely and accurate deadenylase activity and 
fine-tuning of its regulation. 


C. 3’-5' Decay by the Exosome 


After deadenylation, the bulk of the mRNA body is degraded in a 3’—5’ 
manner by the mammalian exosome. The exosome is a multienzyme 
complex present in all eukaryotes that has both nuclear and cytoplasmic 
functions. While the nuclear exosome functions in processing rRNAs, 
snRNAs, and snoRNAs as well as digests improperly processed pre- 
mRNAs (Butler, 2002), the cytoplasmic exosome is involved in mRNA 
turnover (Raijmakers et al., 2004). In agreement with these functions, 
the exosome is localized to the cytoplasm, nucleoplasm, and nucleolus. 
It is assumed to be assembled in the cytoplasm and subsequently 
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transported into the necessary nuclear compartments, an event per- 
haps mediated by the putative nuclear localization signals located on 
the PM/Scl-100, Rrp41p, and PM/Scl-75 subunits (Raijmakers et al., 
2003, 2004). The events dictating how the fully or partially assembled 
exosomes are parceled to a specific nuclear compartment or retained in 
the cytoplasm are not yet known. 

The core of the human exosome is composed of nine proteins that can 
be grouped into two basic subunit types: (1) those with sequence simi- 
larity to the RNase PH domains of E. coli polynucleotide phosphorylase 
(PNPase) (Rrp41, Rrp42, Rrp46, PM/Scl-75, Mtr3, OIP2/Rrp43); and (2) 
those with S1/KH RNA binding domains (Rrp4, Rrp40, Csl4) (Allmang 
et al., 1999; Chen et al., 2001a; Mitchell et al., 1997). The complex also 
associates with several accessory factors, including the RNA helicase 
hSki2w, MPP6, which is a protein of unknown function that is phos- 
phorylated during mitosis, and PM/Scl-100, a protein with homology to 
RNase D (Chen et al., 2001a; Matsumoto-Taniura et al., 1996). In hu- 
mans, the exosome was originally identified as a group of proteins that 
coprecipitate with PM/Scl-75 and PM/Scl-100, antigens to which pa- 
tients with the autoimmune disease polymyositis-scleroderma overlap 
syndrome (PM/Scl) develop autoantibodies (Alderuccio et al., 1991; 
Ge et al., 1992; Gelpi et al., 1990; Reimer, 1990). These factors were 
later discovered to be the human counterparts to the yeast exosome 
components (Allmang et al., 1999; Brouwer et al., 2001). 

Prokaryotic PNPase, which forms the core of the degradosome (the 
prokaryotic counterpart to the exosome), is composed of amino- and 
carboxyl-terminal RNase PH domains and forms a “trimer of dimers” 
core ring structure with a central channel believed to accommodate the 
RNA substrate (Symmons et al., 2002). The fact that the eukaryotic 
exosome contains six RNase PH domain-containing subunits has lead to 
the hypothesis that it forms a similar hexameric ring core structure 
(Fig. 2). Several lines of evidence support this notion; extensive yeast 
and mammalian 2-hybrid analyses of interactions between all known 
human exosomal proteins showed that these six subunits associate 
with each other in an unambiguous order that is further bolstered by 
data from the crystallization of the archaeal exosome core (Lehner and 
Sanderson, 2004; Lorentzen et al., 2005; Raijmakers et al., 2002). These 
components all exhibit sequence homology to known 3’—5’ exonucleases, 
and several (e.g., hRrp46, PM/Scl-75, OIP2) have had nuclease activity 
experimentally demonstrated. Of this group of core components, human 
Rrp41 and Rrp44 can complement the corresponding yeast mutants 
suggesting functional conservation (Brouwer et al., 2001; Estevez 
et al., 2001; Mitchell et al., 1997; Shiomi et al., 1998). 
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Fic 2. The mammalian exosome. The core of the exosome is composed of six ribonu- 
clease subunits (Rrp41, Rrp42, Mtr3, OIP2, Rrp46, and PM/Scl-75) believed to associate 
in a ring-like structure with a central core to accomodate the mRNA, similar to the 
prokaryotic PNPase. It is also bound by three proteins with S1/KH RNA binding domains 
(Rrp4, Rrp40, and Csl4) that are likely involved in recognition of the mRNA substrate, as 
well as several accessory factors (e.g., Ski2, PM/Scl-100, Mpp6). 


As mentioned earlier, the mammalian exosome is also composed of 
three core proteins (Rrp4, Rrp40, and Csl4) with homology to the 
ribosomal S1/KH RNA binding domain. These factors have each been 
shown to interact with specific RNase PH domain subunits and are 
proposed to be located on the outer surface of the core hexameric ring 
(Lehner and Sanderson, 2004; Raijmakers et al., 2002). Thus, they may 
have a role in RNA substrate recognition whereas the other six main 
subunits would encompass the enzymatic core. It is notable that, at 
least in yeast, inactivation of individual exosomal subunits of both 
types (with RNase PH domains or with S1/KH RNA binding domains) 
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inhibits both the nuclear and cytoplasmic functions of the exosome. 
This suggests that all the subunits are required to form one active 
enzymatic complex in cells, rather than the enzymes functioning indi- 
vidually on the substrate RNAs (Allmang et al., 1999; Anderson and 
Parker, 1998; Mitchell et al., 1997). Thus, although individual recom- 
binant yeast exosome components are quite active as nucleases 
in vitro, the RNase function of the exosome complex purified from 
yeast is diminished (Mitchell et al., 1997). This suggests that the 
accessory factors present in cells but not associated with the purified 
exosome are required for in vivo activity of the multienzyme complex 
(Mitchell and Tollervey, 2000). 


D. Decapping and 5'—3' Exoribonucleolytic Decay 


Wang and Kiledjian (2001) identified a novel scavenger decapping 
activity (DepS) in HeLa cell extracts that followed 3’—5’ exosomal decay. 
DepS, which represents a major decapping pathway in mammals, is a 
pyrophosphatase that releases m’GMP upon cap cleavage. Purified 
DepS was shown to be a 40 kDa protein with a histidine triad (HIT) 
pyrophosphatase motif found in nucleotide binding proteins that hydro- 
lize pyrophosphate bonds (Liu e¢ al., 2002). Mutations within this motif 
inactivate DepS, indicating its importance in DepS catalytic function. 
DepS shows a strong preference for hydrolyzing free cap analog or caps 
attached to RNAs less than 10 nt in length (e.g., products created by 3’— 
5’ exosomal decay); in fact, DepS has been shown to associate with the 
exosome in co-immunoprecipitation experiments (Wang and Kiledjian, 
2001). The amino terminus of the protein has been demonstrated to be 
involved in cap binding, and analogous to its catalytic activity DepS 
associates with free cap much more efficiently than longer capped 
mRNAs (Liu et al., 2004b). Crystallization of DepS, revealed it to be an 
asymmetrical dimer that shifts between an open (inactive) or closed 
(active) structure upon a 30 A conformational change (Chen et al., 2005; 
Gu et al., 2004). Based on its structure, it has been proposed that the 
reason DepS binds longer capped mRNAs so inefficiently is related to 
the increased entropy present on larger RNA molecules that could block 
cap binding or prevent the enzyme from adopting the active closed 
structure (Gu et al., 2004). DepS is able to effectively compete with 
eIF4E for binding of free cap but not capped mRNA, suggesting a 
cellular mechanism to prevent eIF4E sequestration by degraded 
mRNAs (Liu et al., 2004b). Although hydrolysis of normal mRNA cap 
structure following exosomal decay occurs in the cytoplasm, a signifi- 
cant fraction of DcpS resides in the nucleus, suggesting additional roles 
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for this enzyme perhaps in the decapping of aberrant transcripts de- 
graded prior to nuclear export (Liu et al., 2004b). 

As mentioned earlier, after deadenylation the major decay pathway 
in yeast is decapping followed by 5’—3’ turnover mediated by the Xrn1 
ribonuclease. A mammalian Xrn1 homolog has been cloned and al- 
though its activity has been detected in cell extracts this appears to be 
a less important pathway in mammalian cells (Bashkirov et al., 1997; 
Gao et al., 2001; van Dijk et al., 2003; Wang and Kiledjian, 2001). The 
presence of Xrn1 in mammalian P bodies may suggest, however, that 
this enzyme is involved in mRNA decay at these sites (Kedersha et al., 
2005). In HeLa cell extracts, the deadenylation-dependent decapping 
activity mimicked that in yeast, in that it was regulated by both cap 
binding proteins and repressed by PABP, and yielded m’GDP product 
(distinct from the m’GMP product of DepS) (Gao et al., 2001). 

Mammalian homologs to the yeast deadenylation-dependent decap- 
ping enzymes Dcp1 and Dcp2 (hDecpla, hDcplb, and hDcp2) were 
cloned in 2002 and, analogous to their yeast counterparts, hDcpla 
and hDcp2 were shown to interact to form a functional enzyme loca- 
lized to the cytoplasm of cells (Lykke-Andersen, 2002; Wang et al., 
2002). hDcp1b appears to be expressed at much lower levels and its 
role in this decapping complex is not yet clear. hDcp2, the catalytic 
subunit, possesses a nucleotide diphosphate linked to X moiety (NU- 
DIX) pyrophosphatase motif that mutational analysis has shown to be 
essential for its activity. In contrast to DepS, hDcp2 cannot hydrolize 
free cap but instead acts on capped deadenylated mRNAs targeted for 
degradation by the 5/—3’ decay system (Piccirillo et al., 2003). hDcep2 
activity is inhibited by PABP, which has been shown to bind directly to 
the cap of mRNAs (but not free cap) after eIF4E removal (Khanna and 
Kiledjian, 2004). hDep1a (also called SMIF) is believed to play more of 
a regulatory role, although it has also been shown to bind Smad4 and 
transloctate to the nucleus upon TGF stimulation and is thus possibly 
involved in modulating multiple cellular pathways (Bai et al., 2002). 
The events dictating whether an mRNA is degraded by the exosome 
followed by DcepS-mediated decapping or instead first decapped by 
hDcp1la/Dcp2 then turned over by hXrn1 have yet to be revealed. 


E. Endoribonucleolytic Decay 


In E. coli, the major mRNA turnover pathways initiate with endori- 
bonucleolytic cleavage that involves the recognition of sequence 
or structural elements within the target mRNA (Kushner, 2004). 
Endonuclease-mediated decay also plays an important role in the turn- 
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over of many eukaryotic messages, particularly those whose expression 
is regulated by extracellular and/or developmental stimuli. For exam- 
ple, this type of turnover is involved in the stability of apolipoprotein I 
(Binder et al., 1989), groa (Stoeckle, 1992), maternal homeodomain 
proteins (Brown and Harland, 1990; Brown et al., 1993), albumin (Pas- 
tori et al., 1991b), transferrin receptor (Tf R) (Binder et al., 1994), IGF II 
(Meinsmaet al., 1991; Scheper et al., 1995), c-myc (Ioannidis et al., 1996; 
Lee et al., 1998), and the globin genes (Albrecht et al., 1984; Lim and 
Maquat, 1992; Wang and Kiledjian, 2000a). Specific sequence or struc- 
tural elements on the target messages likely are very important in 
recruiting endonucleases to mediate turnover (Fig. 3). In several cases, 
these elements are bound by protective factors under certain conditions 
although alternative stimuli cause them to disassociate and allow en- 
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donucleolytic cleaveage. Notable examples of such regulation are the 
TfR and a-globin messages. The 3’ UTR of Tf R contains a five stem-loop 
structure that is protectively bound by the iron regulatory protein 
under conditions of low iron. However, upon increased iron concentra- 
tions, this binding is destabilized, resulting in endonucleolytic cleavage 
of the message within the stem-loop structures (Binder et al., 1994; 
Casey et al., 1989; Seiser et al., 1995). Rather than stem-loop elements, 
a-globin messages contain three cytosine-rich elements in their 3’ UTR 
that associate with an mRNP complex including poly C binding pro- 
teins, the AU-rich element associated protein AUF-1, and PABP, which 
stabilizes the message from endonucleolytic cleavage within the C-rich 
element by the erythroid cell-enriched endonuclease (ErEN) (Waggoner 
and Liebhaber, 2003). In addition, AU-rich instability elements (de- 
scribed later) have been shown to be sites of endonucleolytic cleavage 
on certain messages (Hua et al., 1993; Kowalski and Denhardt, 1989; 
Wennborg et al., 1995). Deadenylation is not a prerequisite 
for endonucleolytic activity on the majority of targets analyzed thus 
far (Binder et al., 1994; Cunningham et al., 2001). One interesting 
exception is the ErEN nuclease involved in a-globin decay, as it has 
been shown to be inhibited by PABP and the poly(A) tail (Wang and 
Kiledjian, 2000b). 

In general, endonucleolytic decay has been difficult to characterize 
due to the extremely rapid rate of degradation of the cleavage pro- 
ducts; thus, only a few vertebrate endonucleases have been identified. 
One of the best studied is polysomal ribonuclease I (PMR1), whose 
activation upon estrogen stimulation of Xenopus hepatocytes has been 
shown to result in degradation of the majority of serum protein 
mRNAs (Pastori et al., 1991a,b). PRM1 activity has also been demon- 
strated in COS-1 and murine erythroleukemia cells (Bremer et al., 
2003; Yang et al., 2004). Unlike many of the eukaryotic decapping 
and exonucleolytic decay enzymes, PRM1 is not located in P bodies or 
in nuclei but rather is distributed diffusely throughout the cytoplasm 
(Yang and Schoenberg, 2004). It has no RNA binding domains and 
therefore does not appear to bind its mRNA substrates directly; it is, 
however, associated with mRNAs on the translating mRNP complex 
(Cunningham et al., 2001; Yang and Schoenberg, 2004). Thus, PRM1 is 
likely to be recruited to its substrates by other mRNA binding proteins, 
whereupon it associates with the mRNP complex until it is activated 
by external stimuli (e.g., estrogen) to cleave the mRNA. It is of note 
that a single endonucleolytic cleavage event would effectively disrupt 
the protective circular mRNA structure thereby providing immediate 
access to the 5’ and 3’ ends of the message by exonucleases; thus, this 
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type of cleavage should result in more rapid mRNA inactivation and 
degradation than deadenylation-mediated decay. 

Of course, the most famous endonucleolytic events are those induced 
by the RNAi pathway to cleave double stranded RNA molecules. 
As this topic has been the subject of numerous reviews, it will only 
be briefly summarized herein. The pathway is initiated by the RNase 
III-like enzyme Dicer-induced degradation of double stranded RNAs 
into siRNAs duplexes characterized by an ~20-nt duplex region and 
2 nucleotide 3’ overhang region. These siRNAs are then incorporated 
into the RNA-induced silencing complex (RISC) that directs endonu- 
cleolytic cleavage of complementary mRNAs at the specific domain 
homologous to the RNA oligo (Elbashir et al., 2001; Martinez et al., 
2002; Nykanen et al., 2001). In mammals, the endonuclease function of 
RISC is carried out by Argonaute 2 (AGO-2) (Hammond et al., 2001; 
Liu et al., 2004a; Rand et al., 2004; Tabara et al., 1999). AGO-2 contains 
a central PAZ domain responsible for binding the 3’ end of substrate 
RNAs as well as recognition of the siRNA oligo 3’ overhangs and a 
carboxyl-terminal Piwi domain believed to fold into an RNase H-like 
catalytic core (Lingel et al., 2008, 2004; Liu et al., 2004a; Ma et al., 
2004; Song et al., 2003, 2004; Yan et al., 2003). After an RNA has been 
cleaved by RISC, evidence in Drosophila cells indicates that the result- 
ing molecules are degraded by the 5’—3’ Xrn1 and 3’—5’ exosomal path- 
ways described for normal mRNA turnover (Orban and Izaurralde, 
2005). In agreement with this idea, AGO-2 has been shown to coloca- 
lize with the mRNA degradation machinery in cytoplasmic P bodies 
(Sen and Blau, 2005). 


F. AU-Rich Instability Elements 


Controlling the rate of mRNA degradation is an important mecha- 
nism to regulate expression of proteins in response to a variety of 
cellular stimuli. Thus, many mRNAs contain specific cis-acting se- 
quences that associate with trans-acting factors involved in regulating 
their stability. The most widespread and best characterized of these 
are instability elements (Bonnieu et al., 1990; Caput et al., 1986; Peng 
et al., 1996; Shyu et al., 1989) although stabilizing sequences also exist 
(Stefanovic et al., 1997). AU-rich elements (ARE), found in the 3’ UTR 
of a number of tightly regulated cellular genes, are the most common 
instability determinants in mammalian cells. Three classes of AREs 
have been described (Chen and Shyu, 1995; Peng et al., 1996; Xu et al., 
1997). Class I AREs contain one to three copies of the AUUUA penta- 
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nucleotide with a nearby U-rich sequence; these AREs are most often 
found in early response genes such as transcription factors (e.g., c-fos) 
and some cytokines (e.g., interleukin 4 and 6 [IL-4, IL-6]). Class II 
AREs contain multiple copies (five to six) of the AUUUA sequence that 
cluster together as well as a 20-30 nucleotide AU-rich region 5’ to 
these clusters; these are frequently found in cytokines (e.g., GM-CSF, 
TNFa, IL-3). Finally, class II AREs do not possess the pentanucleotide 
sequence but rather contain stretches of U residues and a U-rich 
domain (e.g., c-jun). The decay of these different types of AREs is likely 
to occur via distinct mechanisms, as it has been observed that dead- 
enylation of class I and III messages occurs with distributive kinetics 
whereas class II ARE-containing messages are deadenylated in a 
processive manner (Chen and Shyu, 1995; Xu et al., 1997). 

A variety of ARE-binding proteins (AUBPs) have been described that 
either promote ARE message turnover (e.g., TTP, KSRP, AUF1) or 
stabilize such mRNAs (e.g., HuR) (described later). AUBP-mediated 
mRNA turnover is believed to occur by a number of different mechan- 
isms, including recruitment of the exosome (Chen et al., 2001a), stim- 
ulation of decapping (Gao et al., 2001), and promoting deadenylation 
either via direct activation of PARN or perhaps by disrupting interac- 
tions between PABP/poly(A) tail and/or the cap-binding complex/5’ end 
(Gao et al., 2000; Lai et al., 2003; Wilusz et al., 2001). The observations 
that immunodepletion of the exosome resulted in stabilization of ARE 
mRNAs and that several AUBPs were associated with the exosome in 
UV cross-linking and co-immunoprecipitation experiments emphasize 
the importance of the exosome in ARE-mRNA turnover (Chen et al., 
2001a; Mukherjee et al., 2002). Collectively, these data suggest that 
multiple distinct degradative pathways may be activated independent- 
ly during ARE-mediated decay. Since AUBPs appear to be critical 
mediators of this process, we now review the known contributions of 
several of the best characterized of these proteins. 


1. Tristetraprolin 


Tristetraprolin (TTP), the most extensively studied AUBP, is one of 
three members (along with butyrate-responsive factors 1 and 2 [BRF1, 
BRF2]) of the Tis11 family of CCCH tandem zinc-finger proteins, 
which have been shown to associate with ARE-containing mRNAs 
(preferentially class II) and mediate their destabilization (Carballo 
et al., 1998). It is a nuclear/cytoplasmic shuttling protein whose export 
to the cytosol is induced by stimuli, such as serum, growth factors, or 
phorbol esters, and results in its association with AREs (Phillips et al., 
2002; Taylor et al., 1996b). TTP knockout mice exhibit a severe inflam- 


MRNA TURNOVER IN HERPESVIRAL INFECTION 353 


matory phenotype that includes polyarticular erosive arthritis, mye- 
loid hyperplasia, and autoimmunity; these defects were shown to be 
due to increased levels of TNFa and GM-CSF, whose mRNAs contain 
AREs targeted by TTP (Carballo et al., 1997). Both BRF1 and BRF2 
are essential for mouse development (Ramos et al., 2004; Stumpo et al., 
2004; Taylor et al., 1996a) and, like TTP, are involved in destabilization 
of ARE-containing mRNAs (Lai et al., 2000; Stoecklin et al., 2002). 

Lai et al. (2003) used a cell free system to show that TTP activates 
PARN to degrade class II ARE messages. The mechanism of this 
activation is still unclear, as the two proteins do not appear to interact 
(at least directly). Thus, TTP may recruit PARN indirectly via other 
associations or, alternatively, may displace protective factors from the 
mRNA 3’ end that normally prevent PARN access. In addition to 
stimulating deadenylation, Lykke-Anderson and Wagner demon- 
strated co-immunoprecipitation of TTP and BRF1 with multiple 
mRNA turnover enzymes, including hDcpla, hDcp2, hXrn1, hCCR4, 
hRrp4, and Pm/Scl-75 in a non-RNA-dependent manner (Lykke- 
Andersen and Wagner, 2005). Although TTP appears to contain both 
amino- and carboxyl-terminal activation domains, the amino-terminal 
domain alone is necessary and sufficient for these interactions. They 
further showed that overexpression of hDcp2 in HeLa cells stably 
expressing a (-globin-GM-CSF ARE chimeric reporter mRNA signifi- 
cantly increased the rate of turnover of this message but not that of 
stable non-ARE mRNAs, suggesting that decapping may be a limiting 
step of ARE decay. Although the significance of other interactions has 
yet to be revealed, these data indicate that the TTP family may be 
involved in recruiting and perhaps activating numerous pathways of 
mRNA turnover to facilitate ARE degradation. 

Control of TTP activity is a subject of some controversy. TTP is 
phosphorylated by several kinases in a manner that is believed 
to regulate its activity (Carballo et al., 2001; Mahtani et al., 2001; 
Taylor et al., 1995; Zhu et al., 2001). Stoecklin et al. (2004) reported 
that p38 MAPK/MK2-mediated phosphorylation of TTP induced bind- 
ing to 14-3-3 proteins that resulted in inhibition of its ability to de- 
grade ARE mRNAs and excluded it from stress granules (cytoplasmic 
foci containing complexes of translationally stalled mRNAs that accu- 
mulate in response to environmental stress). Furthermore, it has been 
shown that activation of the p38 MAPK/MK2 pathway together with 
overexpression of the ARE stabilizing protein HuR inhibited TTP 
function, and that PKB-mediated phosphorylation of the TTP-related 
protein BRF1 resulted in its association with 14-3-3 proteins and 
inactivation (Ming et al., 2001; Schmidlin et al., 2004). However, Rigby 
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et al. (2005) failed to detect an association of 14-3-3 with TTP and 
found that MK2 activation did not affect TTP function. Additional 
studies of TTP-mediated destabilization of the 64GalT1 gene in resting 
HUVECs also found no role for MK2 in the association of 14-3-3 with 
TTP but instead described an interaction between the 14-3-33 isoform 
and TTP that resulted in retention of TTP in the cytoplasm and facili- 
tation of TTP-induced turnover (Gringhuis et al., 2005). One possibility 
is that TTP phosphorylation by different signaling pathways may lead 
to distinct regulatory outcomes. Additionally, the seemingly contradic- 
tory data regarding 14-3-3 binding may be at least partially reconciled 
by the possibility that the different isoforms of 14-3-3 may have diver- 
gent effects on TTP function. In any case, it seems clear that more 
research is required to deconstruct the complexity of the regulation of 
this important AUBP. 


2. AUF1 


AUF 1 (also called hnRNP D) exists in four isoforms—p37, p40, p42, 
and p45—that are generated by alternative pre-mRNA splicing and 
have distinct in vitro ARE binding affinities (p37 > p42 > p45 > p40) 
(Wagner et al., 1998). Although the majority of data describe roles for 
AUF 1 in ARE-containing mRNA turnover, the observations that it is 
also part of the a-globin mRNA stability complex (Kiledjian et al., 
1997) and is involved in the major coding determinant-directed decay 
of c-fos (Grosset et al., 2000) indicate that this factor likely has multi- 
ple roles in mRNA stability. The bulk of data regarding the effect of 
AUF1 isoforms on ARE-mediated decay comes from analyzing their 
overexpression in various cell lines, which have not always produced 
congruous results. Loflin et al. (1999) demonstrated that the hemin- 
induced stabilization of 3-globin-GM-CSF ARE and (3-globin-c-fos ARE 
reporter constructs in the K562 human erythroleukemic cell line was 
reversed by overexpression of the p37 and p42 (and to a lesser extent 
p40 and p45) isoforms, indicating a role for these factors in mRNA 
destabilization. It was subsequently reported that all four isoforms 
promoted stabilization of class II ARE mRNAs in NIH 8T3 cells, with 
p37 exhibiting the strongest effect (Chen et al., 2004; Xu et al., 2001). 
However, these findings were not supported in experiments performed 
by Sarkar et al. (2003), who instead observed that overexpression of 
p37 and, to some extent, p40, promoted destabilization of a G-Gal-GM- 
CSF reporter in 293, NIH 3T3, HeLa, and COS cells. It has been 
suggested that control of ARE messages may be regulated, at least in 
part, by the relative abundance of each AUF 1 isoform (Raineri et al., 
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2004); thus, one possibility is that this may be a confounding factor in 
interpretations of overexpression data. 

AUF1 has been shown to complex with several cellular factors, 
including hsc70-hsp70, eIF4G, and PABP, and to be regulated by 
ubiquitin-mediated proteasomal decay (Laroia et al., 1999). Degrada- 
tion of GM-CSF ARE messages correlated both with release of eIF4G 
and proteosomal decay of AUF1; proteasome inhibition resulted in the 
relocalization of AUF1 from the nucleus and cytoplasm to the peri- 
nuclear region and nucleus and stabilization of GM-CSF. The ubiqui- 
tination appeared to be specific for the p37 and p40 isoforms of the 
protein (Laroia and Schneider, 2002; Laroia et al., 2002). AUF1 may 
also be regulated by phosphorylation, as dephosphorylation of p40 on 
ser83 and ser87 by TPA treatment of THP-1 leukemia cells resulted in 
the stabilization of IL-18 and TNFa mRNAs (Wilson et al., 2003). 
Finally, siRNA-mediated knockdown of p40/p45 (but not all four iso- 
forms) stabilized both a GFP-ARE reporter and endogenous GM-CSF 
RNAs in HT1080-derived cells (Raineri et al., 2004). Collectively, these 
data indicate that AUF1 is involved in the turnover of a variety of 
ARE-containing messages, although the regulation of the activity 
of each isoform and how they cooperatively contribute to this function 
is likely to be regulated on multiple levels—the majority of which have 
yet to be clarified. 


3. TIA-1/TIAR 


TIA-1 and the TIA-related protein TIAR are members of the RNA 
recognition motif (RRM) family of RNA binding proteins and each have 
three amino-terminal RRM domains that mediate their interaction 
with U-rich sequences (Dember et al., 1996). They are both shuttling 
proteins involved in the regulation of stress-induced translational 
arrest through the recruitment of mRNAs into stress granules to 
prevent translation initiation (Kedersha et al., 1999). TIA-1 associates 
with the ARE of TNFa and COX-2 mRNAs, and expression of both 
factors is increased in TIA-1 null cells (Dixon et al., 2003; Piecyk et al., 
2000). However, unlike most other AUBPs, it appears as though TIA-1- 
mediated repression of these proteins occurs at the level of translation 
rather than mRNA stability. 


4. KSRP 


K homology splicing regulatory protein (KSRP; also called FBP2) 
was identified as an AUBP from Jurkat cell extracts that associated 
with the IL-2 3’ UTR as well as with the exosome (Chen et al., 2001a). 
Gherzi et al. (2004) solidified the role of KSRP in ARE-mediated decay 
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by showing that either immunodepleting it from Jurkat, HeLa, or 
HT1080 cell extracts or abrogating its expression from HeLa cells by 
siRNA methodology resulted in stabilization of CAT reporter RNAs 
fused to AREs of all three classes. They further showed that it asso- 
ciated simultaneously with the exosome and PARN, interactions that 
required the central of its four KH domain RNA binding motifs. Thus, 
it appears as though KSRP may mediate ARE mRNA turnover by 
recruiting specific RNA turnover enzymes to the RNA. Although this 
ARE-mediated turnover occurs in the cytoplasm of cells, KSRP also 
has established roles in splicing (Min et al., 1997) and in accordance 
with the latter function appears to localize strongly to the nucleus of 
HeLa cells (Hall et al., 2004). Thus, as yet unidentified stimuli may be 
required to recruit this protein into the cytoplasm to fulfill its role in 
that locale. 


5. HuR 


HuR is a ubiquitously expressed member of the ELAV family of RNA 
binding proteins (Ma et al., 1996) and, unlike the vast majority of 
other AUBPs described, mediates stabilization rather than turnover 
of ARE-containing messages. It contains three RRM domains, the first 
two are involved in ARE binding whereas the third mediates associa- 
tion with the poly(A) tail (Chung et al., 1996; Ma et al., 1997). In 
addition, HuR possesses a novel HNS shuttling sequence that med- 
iates its transport from the nucleus, where it is predominantly located, 
to the cytoplasm (Fan and Steitz, 1998a). Translocation of HuR to the 
cytoplasm upon treatment of cells with various stimuli correlates with 
ARE mRNA stabilization (Wang et al., 2000b; Yaman et al., 2002). 
In overexpression experiments, HuR has been shown to bind and 
stabilize a number of p38 MAPK-regulated messages, including GM- 
CSF, TNFa, IL-3, and COX-2, suggesting a role for this pathway in 
HuR function (Dean et al., 2001; Fan and Steitz, 1998b; Ming et al., 
2001; Sully et al., 2004). However, the observation that it was not 
responsible for IL-8 stabilization during p38/MK2 activation and IL-8 
sequences required for HuR-mediated stabilization were distinct from 
those involved in p38/MK2 responsiveness indicate that the mechan- 
isms of p38/MK2 and HuR stabilization do not completely overlap 
(Winzen et al., 2004). Antisense or siRNA-mediated HuR depletion 
experiments have confirmed its involvement in the protection of 
several ARE-containing mRNAs, including VEGF, cyclin A, cyclin B, 
p21, urokinase, and urokinase receptor (Levy et al., 1998; Tran et al., 
2003; Wang et al., 2000a,b). 
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III. INTRODUCTION TO HERPESVIRUS-INDUCED Host SHUTOFF 


A number of viruses have been shown to potently inhibit cellular 
gene expression to facilitate their own replication. Such host shutoff 
can occur via a variety of mechanisms; transcription, RNA splicing and 
nuclear export, mRNA decay, and translation are all cellular events 
targeted for disruption during viral infection. Viruses frequently in- 
duce host shutoff to avoid competition from cellular transcripts during 
the mass-production of their own proteins and, additionally, to prevent 
expression of cellular factors involved in stimulating an immune 
response to infection. 

The herpesvirus family encompasses a large group of enveloped, 
double-stranded DNA viruses that infect a wide array of metazoans. 
All herpesviruses are characterized by their ability to engage in two 
alternative genetic programs, latency and lytic replication. In latency, 
the viral genome is maintained in the nucleus at low-copy number, and 
expression of the genome is highly restricted—of the 100-200 viral 
genes, only a handful are typically expressed in latency. As a result, no 
virus production ensues; latency represents, in effect, a persistent 
cryptic state. However, because the entire genome is retained, the 
potential for reactivation of lytic infection exists. During lytic infec- 
tion, the majority of viral genes are expressed in a temporally regu- 
lated cascade; viral DNA replicates to high-copy number and infectious 
progeny are produced with death of the host cell. 

Taxonomically, herpesviruses are subdivided into a, 3, and y sub- 
families, which differ in their genomic organization, lytic cycle kinet- 
ics, and the cell types in which latency is established. Although lytic 
replication of some (3-herpesviruses does not block cellular gene ex- 
pression, members of both the a- and y-herpesvirus subfamilies exhibit 
a prominent host shutoff phenotype during virus production. In both 
cases, the viruses encode proteins that promote the selective degrada- 
tion of mRNA within the cell. In the next sections we review current 
knowledge of herpesvirus-induced mRNA turnover, with emphasis on 
its complex relationship to cellular mRNA decay pathways. 


IV. a-HERPESVIRUS-INDUCED MRNA DECAY 


A. Overview 


Herpes simplex type 1 (HSV-1) is the prototypic member of 
the neurotropic a-herpesvirus family, and both HSV-1 and HSV-2 are 
ubiquitous human pathogens associated with a variety of diseases 
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(Roizman and Knipe, 2001). a-Herpesvirus infection initiates at muco- 
sal surfaces, where the viruses undergo lytic replication in the sur- 
rounding epithelial cells, then subsequently enter the nearby sensory 
nerve termini. There, they exploit retrograde axoplasmic flow to travel 
to the nuclei of sensory ganglia, where they establish a life-long latent 
infection. Subsequently, the latent viruses can experience periodic lytic 
reactivation; reactivated virus travels back down axon termini to cuta- 
neous epithelial cells, where further lytic spread results in lesion 
formation and transmission to new hosts. Lytic a-herpesvirus infection 
induces a global shutoff of host cell gene expression and directs pro- 
duction of three temporal classes of viral mRNAs. The a or immediate 
early class of genes encodes regulatory proteins required for the ex- 
pression of the second £8 (early) class of genes, which play key roles in 
viral replication. The final group of viral genes expressed is the y (late) 
messages encoding structural proteins for assembly of the virion. In 
addition to the newly transcribed viral genes, a-herpesviruses package 
a number of important regulatory factors into the virion tegument 
that are then delivered directly into the newly infected cells for imme- 
diate activity. One such factor is vhs, a protein responsible for degra- 
dation of cellular mRNAs resulting in widespread host shutoff during 
the early stages of infection. As discussed later, vhs activity plays 
integral roles both in facilitating the transition between stages of viral 
gene expression as well as in evasion of the host immune response. 
The ability of HSV to promote mRNA turnover in infected cells was 
first observed in the mid 1970s, although it was known that HSV 
infection caused a decrease in cellular protein synthesis a decade 
earlier (Fenwick, 1984; Sydiskis and Roizman, 1967). Early observa- 
tions that a protein synthesis decline occurred even upon infection 
with UV-inactivated virus (Nishioka and Silverstein, 1978), as well 
as in the presence of cyclohexamide or actinomycin D and in enu- 
cleated cytoplasts (Fenwick and Walker, 1978; Fenwick et al., 1979), 
suggested that a virion component could promote the effect in the 
cytoplasm of infected cells in the absence of viral protein synthesis. 
This shutoff of cellular protein synthesis was originally linked to 
mRNA abundance using polyoma virus-transformed BHK cells in- 
fected with HSV. Monitoring of the levels of polyoma virus mRNA in 
the cells before and during HSV infection revealed that within 5 h of 
HSV infection these levels had declined to 20% of those in uninfected 
cells (Pizer and Beard, 1976). Similar experiments using adenovirus- 
transformed cells showed a significant decline in adenoviral mRNA 
early upon HSV-1 infection (Spector and Pizer, 1978). A preferential 
decline in messenger RNA as opposed to other types of RNA in 
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HSV-infected cells was demonstrated by a reduced ability of cDNA 
probes to anneal to poly(A) RNA but not nonadenylated RNA (Nakai 
et al., 1982). Furthermore, it was shown that pseudorabies virus (PRV) 
infection resulted in a decline in “functional” mRNA that could be 
in vitro transcribed and translated into protein (McGrath and Stevely, 
1980). The ability of HSV to promote a rapid turnover of mRNA was 
shown in a number of different cell types (Mayman and Nishioka, 
1985; Nakai et al., 1982; Nishioka and Silverstein, 1977; Schek and 
Bachenheimer, 1985). These data described the effect of a-herpesvirus 
infection at early times after viral entry (primary shutoff); however, a 
subsequent and distinct secondary shutoff occurred at later times 
during infection that required viral gene expression (Fenwick and 
Clark, 1982; Nishioka and Silverstein, 1978; Read and Frenkel, 
1983). This secondary shutoff phenotype is caused by the product of 
the a27 ORF (ICP27), which blocks pre-mRNA splicing in infected cells 
and also affects nuclear—-cytoplasmic mRNA transport and other 
posttranscriptional processes (Hardwicke and Sandri-Goldin, 1994; 
Sandri-Goldin, 1998). 

Key to identifying the virion-associated factor required for the pri- 
mary shutoff was the isolation of 6 HSV-1 mutants capable of infecting 
cells but unable to block cellular gene expression in the presence of 
actinomycin D (Read and Frenkel, 1983). These mutants were accord- 
ingly termed vhs-1— 6, for their lack of virion host shutoff activity, and 
their ability to replicate to near wild-type levels in tissue culture 
demonstrated that the shutoff function was not essential for virus 
growth in vitro. However, analysis of viral polypeptide synthesis dur- 
ing the vhs mutant infections revealed abnormally high levels of the 
aGimmediate early) proteins, providing the first indication that this 
primary shutoff function played an additional role in regulating cer- 
tain aspects of viral gene expression. Significantly, infection with the 
vhs mutants still induced the secondary shutoff in the absence of 
actinomycin D, providing compelling evidence that the primary and 
secondary shutoff functions are carried out by distinct viral polypep- 
tides (Read and Frenkel, 1983). Subsequent analysis of the vhs-1 
mutant virus showed it to be defective in promoting enhanced turnover 
of both cellular and viral mRNAs (Kwong and Frenkel, 1987; Oroskar 
and Read, 1989; Strom and Frenkel, 1987). Vhs-1 mapped to a single 
base mutation within the UL41 open reading frame (ORF) of HSV-1 
(Kwong and Frenkel, 1989; Kwong et al., 1988). 

Shortly after mapping vhs mutations to HSV-1 UL41, peptide anti- 
sera were used to show that this ORF encoded a 58 kDa phosphopro- 
tein expressed late in infection, which, as predicted from the shutoff 
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activity of UV-inactivated virus, was a component of the viral tegu- 
ment (McLauchlan et al., 1992; Smibert et al., 1992). Evidence indi- 
cates that vhs protein present in the virions is released into the 
cytoplasm of newly infected cells, where it promotes a widespread 
degradation of mRNA. Vhs homologs are found in all a-herpesviruses, 
including HSV-1, HSV-2, varicella-zoster virus (VZV), PRV, bovine 
herpesvirus 1 (BHV-1), and equine herpesvirus 1 (EHV-1) but not in 
b- or y-herpesviruses (Berthomme et al., 1993). In addition, all vhs 
homologs have been shown to possess RNase activity in cells, albeit 
with differing efficiencies; for example, the HSV-2 vhs is 40—50-fold 
more active than that of HSV-1, whereas VZV and EHV-1 vhs display 
significantly reduced activity (Everly and Read, 1997; Lin et al., 2004a; 
Sato et al., 2002). As described in the following section, over the past 
20 years significant advances have been made toward understanding 
vhs biology and the mechanisms by which it promotes degradation of 
cellular and viral mRNAs as well as facilitates the sharp transition 
between a, 3, and y viral gene expression. 


B. Vhs as a Ribonuclease 


The development of in vitro assays to analyze vhs activity greatly 
facilitated the dissection of its RNase function (Smiley et al., 2001). 
Krikorian and Read were the first to describe such a system and using 
HeLa cytoplasmic extracts they showed that GAPDH and TK mRNAs 
were rapidly degraded in extracts from wt HSV-1 but not vhs-1 
mutant-infected cells (Krikorian and Read, 1991). The mRNA degra- 
dation proceeded even after treatment with RNasin to inactivate non- 
specific cellular RNases but was inhibited by proteinase K digestion, 
exposure to high temperature, or high K* concentrations (>7 mM). 
Furthermore, this activity was strongly dependent on the presence of 
Mg”* (optimally 20 mM). Additional evidence that the HSV-associated 
RNase activity derived from vhs rather than a contaminating cellular 
component came from Zelus et al. (1996), who showed that °“P-labeled 
G3-globin mRNA was effectively degraded upon incubation with purified 
wt but not vhs-1 or ASma [a truncation mutant of vhs (Read et al., 
1993)] mutant virion extract; preclearing of the extract with anti-vhs 
antibodies blocked the mRNA turnover. The observations that in vitro 
translated vhs protein from rabbit reticulocyte lysates (RRL) similarly 
degraded exogenously added mRNA and that reporter gene expression 
was blocked in cells transiently transfected with a vhs-expressing 
plasmid demonstrated that vhs RNase activity did not require 
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additional viral components (Jones et al., 1995; Pak et al., 1995; Zelus 
et al., 1996). 

These in vitro degradation systems for vhs also allowed a number of 
questions regarding the mechanism of its RNA turnover function to be 
addressed. For example, the fact that vhs promotes widespread mRNA 
turnover but spares rRNA and tRNA suggests that mRNA molecules 
may possess a specific feature(s) that renders them susceptible to 
decay. However, the most obvious distinguishing elements on mRNAs, 
their 5’ cap and 3’ poly(A) tail, were shown to be dispensable for RNA 
turnover by RRL-translated vhs (Elgadi et al., 1999; Zelus et al., 1996). 
Precise analysis of the degradative intermediates of a labeled SRPa 
RNA revealed multiple endonucleolytic cleavage events by vhs, which 
appeared to cluster toward the 5’ end of the transcript (Elgadi et al., 
1999). Additional experiments measuring degradation intermediates 
of the TK mRNA in infected HeLa cells confirmed a more rapid decay of 
the 5’ mRNA segment over the 3’ end (Karr and Read, 1999). Vhs 
remained active in the RRL system after removal of the ribosomes by 
centrifugation, arguing against the idea that vhs is directed to mRNAs 
undergoing translation (Elgadi et al., 1999). Addition of an EMCV or 
poliovirus IRES element into reporter mRNAs in multiple positions 
resulted in a strong preference for endonucleolytic cleavage just down- 
stream of the IRES, indicating that these elements can somehow 
influence vhs activity (Elgadi and Smiley, 1999). Unexpectedly, the 
vhs-1 mutant, which fails to degrade non-IRES containing mRNAs, 
was still able to promote endonucleolytic cleavage specifically after the 
ECMV IRES element in vitro (Lu et al., 2001). After the initial IRES- 
dependent cleavage event, the vhs-1 mutant failed to subsequently 
promote degradation of the resulting RNA fragments, however, sug- 
gesting that perhaps wt vhs promotes multiple types of mRNA degra- 
dation and that the vhs-1 mutant is defective in only select aspects of 
the enzymatic function. 

Vhs displays weak homology to the cellular fen-1 nucleases (Doherty 
et al., 1996), a family of enzymes with roles in DNA replication and 
repair. In addition, a BLAST search followed by hidden Markov 
modeling revealed similarities between vhs and a larger family of 
nucleases from humans, yeast, and E. coli, including XPG, RAD2, 
and Pol I (Everly et al., 2002). Vhs proteins within the a-herpesvirus 
family display significant homology in three conserved regions of the 
protein (Berthomme et al., 1993; Everly and Read, 1997; Jones et al., 
1995), and the first two conserved vhs domains exhibit homology to 
critical residues within the catalytic sites of the aforementioned cellu- 
lar nucleases. Significantly, mutation of several vhs amino acids 
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corresponding to catalytic residues in the homologous cellular nu- 
cleases reduced or abrogated vhs nuclease activity, providing further 
genetic evidence that vhs is a nuclease (Everly et al., 2002). 

Although all of the previously mentioned experiments strongly sug- 
gest that vhs possesses RNase activity, without directly purifying the 
protein the possibility that vhs does not act alone cannot be excluded. 
For example, vhs could activate a host nuclease or could represent an 
essential subunit of a cellular RNase complex; alternatively, vhs might 
require a cellular cofactor to activate its own latent nuclease function 
or to direct it to its mRNA targets. Several lines of evidence suggest 
that in fact vhs likely requires at least one mammalian factor to 
achieve efficient cellular mRNA turnover. Although vhs retains some 
nuclease activity in yeast, degradation of reporter RNAs in this system 
is markedly less efficient than in mammalian extracts and there is no 
global turnover of mRNA in vhs-expressing yeast (Doepker et al., 2004; 
Lu et al., 2001). In addition, vhs-mediated degradation in yeast pro- 
duces a different mRNA decay pattern and cleavage after IRES ele- 
ments is not observed, suggesting that yeast lack one or more 
mammalian factors required for proper targeting of vhs to mRNAs. 
Supporting this notion is the observation that addition of RRL to the 
vhs-expressing yeast lysates both enhances vhs activity and restores 
targeting of vhs to IRES elements (Doepker et al., 2004). Furthermore, 
extracts of partially purified HSV virions exhibit a vhs-induced RNA 
degradation pattern that lacks the specificity observed in vivo and in 
RRL, in that non-mRNAs do not escape degradation, and the vhs- 
induced turnover does not occur at the 5’ RNA sites preferred in both 
the RRL and HeLa cell systems (Zelus et al., 1996). These data suggest 
that the virion extracts lack one or more components necessary to 
target vhs appropriately to its substrates. 

It is intriguing that sites of vhs-mediated endonucleolytic cleavage 
are at the 5’ end of mRNAs and directly after IRES elements, given 
that these are regions associated with mRNA translation initiation. 
Although neither translational inhibitors nor the absence of a 5’ cap 
appear to interfere with vhs-mediated turnover, it remains possible 
that vhs associates with a translation initiation factor that directs it to 
these sites on the mRNA (Elgadi and Smiley, 1999; Elgadi et al., 1999; 
Schek and Bachenheimer, 1985; Strom and Frenkel, 1987). After all, 
such an event would not require actual translation initiation or elon- 
gation but could nonetheless provide the specificity exhibited by vhs 
for mRNAs in vivo. In support of this idea, vhs was shown to interact 
with the mammalian translation initiation factor eIF4H and the RNA 
helicase eIF4AII in a yeast 2-hybrid system and in mammalian cell 
co-immunoprecipitations and also with eIF4B in vitro (Doepker et al., 
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2004; Feng et al., 2001, 2005). Several vhs mutants with defective 
RNase activity failed to bind eIF4H in GST pulldown assays, suggest- 
ing that this interaction may be important for vhs-mediated mRNA 
turnover. eIF4H functions early in translation initiation, where it is 
believed to assist in mRNA unwinding and scanning of the small 
ribosomal subunit (Hershey and Merrick, 2000). eIF4B is the sequence 
paralog of eI[F4H and assists eIF4H with the helicase activity of eIF4A 
(Richter et al., 1999; Rogers et al., 2001). The observation that eIF4AIT 
(one of the three eIF4A isoforms) associates directly with both eIF4H 
and vhs provides a link between vhs and the eIF4A/eIF4G/eIF4E cap 
binding complex (Feng et al., 2005). Vhs mutants, such as T2141, 
which cannot bind eIF4H but retain binding to eIF4AII and are defec- 
tive for turnover of mRNAs translated by scanning, indicating that 
interactions of vhs with both proteins may be required for appropriate 
mRNA targeting and cleavage. Thus, one attractive model is that these 
factors direct vhs to sites of translation initiation on mRNAs, perhaps 
thereby providing both the specificity for mRNAs and explaining the 
preferential endonucleolytic cleavage by vhs within the 5’ quadrant of 
these molecules. This notion is supported by the fact that addition of 
partially purified eIF4H or eIF4B to vhs-expressing yeast extracts en- 
hances the turnover of reporter mRNAs. However, neither of these 
proteins can restore vhs-mediated targeting to IRES elements in the 
yeast system, suggesting that additional mammalian factors (likely 
eIF4A, and perhaps others) are involved in directing vhs-mediated 
turnover and/or contributing to its enzymatic activity (Doepker et al., 
2004). The eIF4H-vhs interaction may also help resolve the issue of 
whether vhs is itself an RNase by facilitating purification of the vhs 
protein. The major impediment to definitively calling vhs a nuclease has 
been the inability to generate highly purified protein due to its insoluble 
nature. However, Everly et al. (2002) revealed that co-expression with 
eIF4H significantly enhanced the solubility of vhs in E. coli, and their 
purified vhs from this system retained nuclease activity (although 
additional protein bands were present in the vhs fractions). 

From these data it can be seen that the vhs endonucleolytic activity 
may parallel that of cellular endonucleases, such as PMR1, in that it 
appears to be similarly targeted to specific sites on target mRNAs via 
other RNA or cap binding factors and/or structural elements (Fig. 4). 
However, the fact that mammalian endonucleolytic cleavage is re- 
served for select messages whereas vhs promotes global mRNA turn- 
over would indicate that the proteins and/or structural elements which 
recruit or activate vhs are much more widespread than those involved 
in the selective mammalian endonucleolytic decay. The observed inter- 
actions between vhs and elF4 proteins fit this prediction as these 
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Fic 4. Model for vhs targeting to mRNAs. Vhs is packaged into the viral particles, and 
is thus present in the cell immediately after infection, where it promotes rapid turnover 
of mRNAs. Vhs preferentially cleaves near the 5’ end of messages, and is likely targeted 
to these sites via its interactions with the translation initiation factors eIF4H, eIF4A, 
and eIF4B. Vhs-mediated endonucleolytic cleavage then produces exposed and unpro- 
tected 5’ and 3’ termini, which would be susceptible to rapid turnover by cellular 
exonucleases and/or additional cleavages by vhs. 


factors should associate with a large pool of mRNAs. An additional 
interesting issue is whether decay of vhs-cleaved messages proceeds 
via the Xrn1 and exosomal pathways as is likely the case for mRNAs 
targeted by mammalian endonucleases, or whether vhs itself com- 
pletes the degradation. One or a few vhs-mediated cleavage events 
would effectively inactivate a message and render it susceptible to 
immediate degradation by cellular exonucleases. Although purified 
virion extract does promote turnover of labeled substrate RNAs, the 
mRNA half-lives were shortened substantially in the presence of cyto- 
sol, suggesting that the cleavage products are targets of cellular 
nucleases (Zelus et al., 1996). 


C. Degradation of ARE-Containing Cellular Messages 


DNA microarray analysis has identified a number of cellular genes 
that are upregulated during HSV infection, even in the face of host 
shutoff (Taddeo et al., 2002). Several of these genes were noted to 
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contain AREs within their 3’ UTRs, characteristic of tightly regulated 
unstable messages such as cytokines and growth factors. As discussed 
earlier in this review, inactivation of ARE-containing messages is 
assisted by several cellular ARE binding proteins, including TTP, 
TIA-1, and TIAR. Cells infected with HSV-1 exhibited an increase in 
TTP expression at both the mRNA and protein level (Esclatine et al., 
2004). Furthermore, the TTP in infected cells was localized diffusely 
within the cytoplasm whereas it was predominantly nuclear in unin- 
fected cells [except upon mitogen stimulation (Taylor et al., 1996b)]. 
Although neither TIA-1 nor TIAR were upregulated by HSV-1, both 
proteins were similarly present in the cytoplasm of infected cells, 
reflective of an activated state (Esclatine et al., 2004). It is unlikely 
that vhs is directly responsible for the TTP upregulation given 
that TTP was not induced upon infection with a Aa4 virus that con- 
tains active vhs. However, infection of HEp-2 or HFF cells with a vhs 
mutant virus (AU;41) similarly failed to induce TTP, suggesting 
that vhs is somehow involved in TTP accumulation. Supporting this 
hypothesis is the observation that vhs interacts with TTP in co- 
immunoprecipitations from infected cells. In addition, AU;,41-infected 
cells displayed an aggregate pattern of TIA-1/TIAR staining in the 
cytoplasm reminiscent of stress granules rather than the diffuse stain- 
ing observed during wild type HSV-1 infection. Thus, interactions of 
these factors may be altered in the absence of vhs leading to the 
divergent localizations. 

The outcome of the upregulation and/or cytoplasmic relocalization of 
these AUBPs with respect to the stability of HSV-induced ARE-con- 
taining cellular mRNAs is currently controversial. Taddeo et al. (2003) 
examined the expression of one such mRNA, the stress-inducible cel- 
lular gene JEX-1, throughout infection of cells with HSV-1 or AU;41. 
They reported that although early in infection JEX-1 mRNA was upre- 
gulated and translated into protein, at later times the message existed 
in partially degraded forms lacking 3’ sequences and was therefore 
incapable of producing protein product. These degradation products 
were not seen in AU;41 infections, suggesting that vhs was responsi- 
ble for their cleavage. Esclatine et al. (2004) extended these results to 
show that similar degradation products accumulated in a vhs- 
dependent manner for the ARE-containing c-fos and IkBa mRNAs 
but not for GADD45 and TTP mRNAs that lack AREs. The authors 
argue for a model whereby the induction of ARE-containing cellular 
genes that may be detrimental to viral infection or immune evasion is 
quenched by a selective and sequence-specific degradation of the 
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mRNAs by vhs, perhaps assisted by the AUBPs TTP and TIA-1/TIAR 
(Esclatine et al., 2004; Taddeo et al., 2003, 2004). 

However, this hypothesis has been contested by data from Hsu et al. 
(2005), who failed to detect any increased degradation of JEX-1 mRNA 
during infection. While they did observe the degradation intermedi- 
ates described earlier during HSV-1 infection, they also found them in 
cells infected with vhs-deficient virus as well as in uninfected cells. In 
addition, they measured the half-life of JEX-1 mRNA by actinomycin D 
chase assays and found it to be stabilized both in cells infected with 
wild-type and vhs-null virus relative to uninfected cells. Thus, rather 
than stimulating selective degradation of JEX-1, their data suggest 
that HSV-1 infection stabilizes this message even in the presence of 
vhs. They present the possibility that the vhs-TTP interaction may 
contribute to this effect by disrupting the normal turnover of select 
ARE-containing messages in infected cells (Hsu et al., 2005). 

Although more research is required to resolve this issue, in either 
case the tantalizing link between vhs and cellular ARE-mediated 
mRNA decay is anticipated to reveal novel aspects of vhs-mediated 
as well as perhaps cellular turnover of this important class of mes- 
sages. It will also be of considerable interest to determine how the 
group of non-ARE containing messages upregulated early in HSV 
infection is evading degradation by vhs. 


D. Regulation of Vhs During HSV Infection 


It is well established that the nuclease activity of vhs plays an 
integral role in facilitating the sharp transition from immediate early 
(a) to early (8) and late (y) viral gene expression, due to the fact that 
vhs delivered into the cytoplasm from infecting virions degrades both 
cellular and immediate early viral mRNAs. In the absence of other 
viral factors vhs readily degrades 8 and y mRNAs as well, although 
during infection these genes are expressed to high levels. Thus, it 
seemed likely that one or more viral factors might exert a temporal 
control of vhs activity to protect delayed-early (8) and late (y) viral 
transcripts from turnover. A candidate vhs regulatory factor was re- 
vealed when Smibert et al. (1994) showed that VP16, a viral structural 
protein involved in the transcriptional activation of the a-genes 
(Batterson and Roizman, 1983; Campbell et al., 1984), interacted with 
vhs. VP16 is a highly abundant tegument protein, which, like vhs, is 
delivered directly into cells by infecting virions. It is composed 
of a C-terminal acidic activation domain and a DNA binding domain 
that targets it to a-gene promoters via interactions with the host 
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factors Oct1 and HCF (Herr, 1998; Herr and Cleary, 1995). The signifi- 
cance of the VP16—vhs interaction became evident upon the observation 
that infection with a VP16 null mutant virus (8MA) resulted in drasti- 
cally reduced levels of viral protein synthesis, suggesting that VP16 was 
required for the maintenance of viral intermediate and late protein 
levels (Lam et al., 1996). This reduction was not due to the absence of 
VP16 transcriptional activation activity, as neither infection with a 
VP16 truncation mutant lacking the activation domain but retaining 
its vhs binding capability nor with a VP16—vhs double knockout (8MA/ 
ASma) resulted in a viral protein synthesis decline. 

The interaction domains of these factors were mapped to amino 
acids 310-330 of vhs and a nonlinear VP16 motif (making precise 
mapping difficult), although isolation of a VP16 L344 point mutant 
defective for vhs binding but retaining its other functions demon- 
strated these activities of VP16 are separable (Knez et al., 2003; 
Schmelter et al., 1996). Deletion of the 20-amino acid VP16 binding 
domain of vhs does not affect its ability to degrade reporter RNAs in 
transient transfection assays, in agreement with the fact that the 
binding domain is located outside of the region required for nuclease 
activity (Strand and Leib, 2004). However, in the context of a viral 
infection this mutant displayed several unexpected properties. First, 
in cells infected with the VP16 binding domain mutant virus (A20) vhs 
A20 retained the ability to interact with virally expressed VP16. Thus, 
the interaction of vhs with VP16 that occurs in yeast 2-hybrid assays 
and in co-immunoprecipitations from RRLs does not mimic their asso- 
ciation during a viral infection. Secondly, despite the ability of vhs A20 
to bind VP16 during infection, this mutant virus lacked vhs activity 
upon infection in the presence of actinomycin D, although it exhibited 
wild-type levels of vhs activity in the absence of actinomycin D. Thus, 
while the newly made vhs A20 was active, the vhs A20 derived from 
the tegument was not. Furthermore, this A20 virus was attenuated in 
mice, although not to the extent of a vhs null virus. Collectively, these 
data indicate that this 20 amino acid region plays a critical role in 
tegument-derived vhs function, perhaps as a site of a specific phos- 
phorylation event or other posttranslational modification required for 
release from the virus particle in an active form. 

Although the mechanism whereby VP16 modulates vhs activity is 
not yet known, the simplest model is that VP16 binds and inactivates 
newly synthesized vhs, for example, by preventing other necessary 
cellular interactions or sequestering vhs within the nucleus. There 
are many more copies of VP16 in the tegument than there are of vhs, 
yet tegument-derived vhs is not inactivated by the incoming VP16. 
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One possibility is that VP16 is complexed with other factors in the 
virion that preclude its ability to bind vhs or, alternatively, upon entry 
into cells VP16 preferentially interacts with its cellular binding part- 
ners and is titrated away from the vhs molecules. The vhs-induced host 
shutoff could then be envisioned to eliminate these cellular binding 
competitors and free up VP16 to associate with and inactivate vhs. 


E. Vhs Activity In Vivo 


Although deletion of vhs has only minimal effects on HSV replication 
in cell culture, it has a profound influence on viral pathogenicity and 
clearance in mice and inhibits establishment of latency as well as 
reactivation (Leib et al., 1999; Strelow and Leib, 1995, 1996; Strelow 
et al., 1997). This is presumably due to the ability of vhs to block host 
immune responses via downregulation of key immunomodulatory mo- 
lecules. As these data are the subject of a review (Smiley, 2004), they 
will only be briefly summarized herein. 

Initiation of an in vivo antiviral immune response requires 
the activity of dendritic cells (DCs), potent antigen presenters that 
acquire antigens from peripheral tissues then migrate to the lymph 
nodes where they mature and stimulate naive T cells (Banchereau and 
Steinman, 1998). Infection with HSV-1 has been shown to inhibit the 
maturation and T cell stimulatory capacity of DCs as well as down- 
regulate several cell surface markers, thus evading this integral arm of 
the immune system (Kruse et al., 2000; Salio et al., 1999). Significantly, 
Samady et al. (2003) subsequently demonstrated that deletion of the 
vhs gene from HSV releases the block to DC activation thereby permit- 
ting T cell stimulation, suggesting that early host shutoff plays a role 
in escaping T cell-mediated antiviral responses of the host. However, 
that HSV-1 infection of DCs induces several changes that are not 
dependent on vhs, including the downregulation of CCR7 and CXCR4 
that results in migration defects (Prechtel et al., 2005). a-Herpes- 
viruses also evade cytotoxic T lymphocyte (CTL) responses by interfer- 
ing with cell surface MHC I expression through both a reduction in the 
overall levels of MHC I synthesis and by ICP47-mediated inhibition of 
TAP peptide transport (Fruh et al., 1995; Hill et al., 1994, 1995; York 
et al., 1994). In addition, HSV-1 infection of glioblastoma cells results 
in a shutoff of MHC class II synthesis and cell surface expression 
(Trgovcich et al., 2002). The activity of the corresponding viral vhs 
proteins has been shown to contribute to the general decrease in both 
MHC class I and class II expression (although other mechanisms are 
involved in the specific surface downregulation of these factors); thus, 
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vhs likely contributes to viral evasion of T cell responses (Gopinath 
et al., 2002; Koppers-Lalic et al., 2001; Tigges et al., 1996). 

Evidence suggests that HSV-2 vhs plays a role in viral evasion of 
innate immunity as well via modulation of the a interferon (IFN) 
response. IFN release from infected cells triggers a signal transduction 
cascade resulting in expression of an array of IFN-stimulated genes 
(ISGs) that block protein synthesis, degrade RNA, and inhibit viral 
replication (Taniguchi and Takaoka, 2002). Unlike many other viruses, 
HSV infection is quite resistant to the effects of this response (Harle 
et al., 2002; Leib et al., 1999; Mossman et al., 2000). HSV-2 vhs 
mutants were severely attenuated for replication and pathogenesis in 
wild-type 129 mice but not in IFN-a receptor knockouts, suggesting 
that vhs may be a key player in HSV-2-mediated IFN resistance 
(Murphy et al., 2003). In contrast, an HSV-1 vhs mutant remained 
defective in both types of mice, in agreement with previous data 
showing that IFN resistance in HSV-1 mapped to viral factors other 
than vhs (Leib et al., 1999; Murphy et al., 2003). Additional data 
however suggest that HSV-1 vhs may instead cooperate with ICPO to 
dampen the effects of direct ISG stimulation by IRF-3 and IRF-7 in the 
absence of IFN activity, thereby contributing to HSV-1-mediated eva- 
sion of the innate immune response (Lin et al., 2004b). Furthermore, 
Suzutani et al. (2000) showed in a mouse encephalitis model that an 
HSV-1 vhs-deficient mutant inoculated into the mouse brain had se- 
vere virulence defects unless the mouse was subjected to y-ray irradi- 
ation thereby eliminating sensitive cells such as lymphocytes and 
neutrophils (Suzutani et al., 2000). They suggest that vhs therefore 
may be involved in viral evasion of these host defense mechanisms, 
and support this hypothesis with the observation that cells infected 
with the vhs null virus expressed significantly higher levels of IL-12, 
IL-8, and MIP-1a. 

Collectively, these data indicate that in addition to facilitating virus 
replication and gene expression in infected cells, vhs likely plays a 
critical immune evasion role in vivo by contributing to the suppression 
of both specific and nonspecific host immune responses. 


V. y-HERPESVIRUS—INDUCED MRNA Decay 


A. Overview 


y-Herpesviruses comprise a group of lymphotropic viruses contain- 
ing two members—Kaposi’s sarcoma-associated herpesvirus (KSHV) 
and Epstein-Barr virus (EBV)—that cause human disease. KSHV is 
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the most recently identified human tumor virus and is associated with 
several proliferative disorders, the most common being Kaposi’s sarco- 
ma (KS) (Moore and Chang, 2001). Although originally described as a 
rare tumor found predominantly in elderly Mediterranean or African 
men, with the onset of the AIDS epidemic KS became the most com- 
mon neoplasm associated with untreated human immunodeficiency 
virus (HIV) infection. KS lesions are characterized by proliferation of 
spindle-shaped endothelial cells, infiltration by host inflammatory 
cells, and striking neoangiogenesis. Although still incompletely under- 
stood, KS pathogenesis is thought to involve high-level production of 
growth factors, inflammatory cytokines, and angiogenic factors by one 
or more of the cellular components of the lesion. In addition to KS, 
KSHV is associated with two rare lymphoproliferative disorders, pri- 
mary effusion lymphoma (PEL) and multicentric Castleman’s disease 
(MCD). 

Similar to herpes simplex viruses, EBV is nearly ubiquitous in the 
human population; nearly 90% of the world is infected with this virus 
(Kieff, 1996). The majority of infected people remain asymptomatic, 
although EBV is linked to several human cancers including Burkitt’s 
lymphoma, Hodgkin’s disease, and nasopharyngeal carcinoma. It is also 
the causative agent of infectious mononucleosis, which can arise in 
persons whose primary exposure does not occur until adolescence. In 
immunocompromised hosts, such as AIDS patients and transplant re- 
cipients, the virus can induce fatal B-cell lymphoproliferative disease. 
EBVestablishes a latent and lifelong infection in B lymphocytes and, as 
with KSHV, only a small subpopulation of infected cells undergoes lytic 
replication. 

The question of whether 7-herpesviruses induce a host shutoff upon 
lytic infection had lain neglected largely because of technical difficul- 
ties. For EBV and KSHV, typical stimuli that induce lytic reactivation 
do so inefficiently, resulting in induction of only 5-20% of latently 
infected cells. Under such circumstances, continued host gene expres- 
sion in the majority latent population would obscure any shutoff in the 
lytic population. However, within the last few years it has become 
possible to bypass these experimental difficulties, and with this has 
come the realization that, like their a-herpesvirus counterparts, sev- 
eral y-herpesviruses also promote a global shutoff of cellular gene 
expression at the level of mRNA stability. Though, these viruses have 
no vhs homolog and instead utilize a completely different viral gene 
product to promote cellular mRNA turnover. Furthermore, data sug- 
gest that the mechanism(s) by which the shutoff is achieved at least 
during KSHV infection are likely to be quite unique. 
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Unlike most a-herpesviruses, the default pathway for KSHV infec- 
tion is latency, during which only a restricted subset of viral genes 
required for maintenance of the viral episome and promoting cell 
growth are expressed (Dupin et al., 1999; Friborg et al., 1999; Radkov 
et al., 2000). However, there is also a small subpopulation of infected 
cells undergoing spontaneous lytic induction; this lytic cycle can be 
more efficiently induced by ectopic expression of the viral major lytic 
transactivator RTA (Cannon et al., 2000; Gradoville et al., 2000; Lukac 
et al., 1998; Miller et al., 1996). By optimizing the efficiency of lytic 
induction, as well as by developing single-cell assays for shutoff, 
Glaunsinger and Ganem (2004b) showed that a robust shutoff of host 
protein synthesis initiated early in infection was not blocked by the 
viral DNA replication inhibitor PFA and continued until cell death 
(Glaunsinger and Ganem, 2004b). Moreover, analysis of several cellu- 
lar messages by northern blotting and nuclear runoff assays showed 
that while cellular genes were efficiently transcribed in infected cells, 
total mRNA levels were drastically reduced by 12 h post reactivation. 
Overall, these data indicated that lytic KSHV infection promotes deg- 
radation of cellular mRNAs. Similar results were subsequently ob- 
tained from experiments analyzing EBV-induced changes to host 
gene expression during lytic reactivation of EBV-positive AKBM cells 
(Ressing et al., 2005; Ressing, M. E., and Wiertz, E., in preparation). 
These cells exhibited an EBV-induced reduction in total cellular pro- 
tein synthesis that did not require late viral gene expression. More- 
over, RT PCR analyses of several cellular transcripts showed that, 
similar to KSHV, this shutoff manifested at the level of mRNA. As 
discussed in the next section, a screen of KSHV lytic genes for the 
ability to promote shutoff revealed a single viral factor, encoded by 
ORF 37 (termed SOX), to be responsible for this phenotype; its EBV 
homolog (BGLF5) was shown to possess a parallel function (Ressing, 
M. E., Glaunsinger, B., Ganem, D., and Wiertz, E., in preparation). 


B. Effectors of y-Herpesvirus—Induced Host Shutoff 


A screen of KSHV genes for their ability to promote turnover of a 
GFP reporter protein in 293 cells lead to the identification of ORF37 as 
a candidate host shutoff factor (Glaunsinger and Ganem, 2004b). In 
agreement with this function as well as its activity as a DNase (dis- 
cussed later), the protein was named SOX (shutoff and exonuclease). 
SOX expression initiates at approximately 8 h post lytic reactivation, 
precisely concordant with the onset of host shutoff. Unlike vhs, SOX 
is not packaged into the virion (Bechtel and Ganem, unpublished 
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observations) and therefore cannot promote host shutoff immediately 
upon KSHV entry. Co-expression of this gene with a GFP reporter 
message resulted in disappearance of the GFP mRNA, despite the 
fact that transcription of the gene was unaffected as were the levels 
of GFP DNA. It was subsequently shown that SOX significantly re- 
duced the half-life of the GFP message. SOX-induced mRNA turnover 
was also shown for several additional cellular messages, confirming 
that its activity paralleled the shutoff phenotype observed during 
KSHV lytic infection described earlier (Glaunsinger and Ganem, 
2004b; Glaunsinger et al., 2005). Finally, siRNA-mediated knockdown 
of SOX during KSHV replication effectively blocked turnover of the 
cellular messages, demonstrating that it is the dominant effector of 
KSHV-induced host shutoff. 

Surprisingly, the gene encoding SOX is conserved across the entire 
herpesvirus family, where it is known to function as an alkaline exo- 
nuclease (AE) on viral DNA (Goldstein and Weller, 1998; Martinez 
et al., 1996). In addition, it has been revealed that the HSV-1 AE 
homolog (UL12) exhibits strand exchange activity in vitro and acts in 
conjunction with the single stranded DNA binding protein ICP8 as a 
recombinase (Reuven and Weller, 2005; Reuven et al., 2003, 2004). 
These functions are believed to be important for resolution of branched 
structures in newly replicated viral DNA, as HSV-1 UL12 mutants 
accumulate highly structured DNA in the nucleus leading to poor 
packaging of the genome and inefficient nuclear egress of the progeny 
virions (Shao et al., 1993). Although the HSV AE protein exhibits both 
5’ and 3’ exonuclease activity and limited endonuclease activity on 
DNA, it does not possess any RNA turnover activity (Glaunsinger 
and Ganem, 2004b; Knopf and Weisshart, 1990; Sheaffer et al., 
1997). Furthermore, the AE proteins have no role in host shutoff in 
either a- or 3-herpesviruses. KSHV SOX does retain the conserved 
DNase activity of its homologs—as would be expected considering both 
the essential nature of that function and the conservation of regions 
known to be required for DNase activity. Thus, it appears as though 
evolution has endowed KSHV SOX with a novel mRNA degradation 
function. 

In accordance with its established functions, HSV-1 AE has been 
shown to exhibit a strictly nuclear cellular localization. In contrast, 
SOX localization mapped to both the nucleus and cytoplasm of cells 
(Glaunsinger et al., 2005). SOX mutants lacking the putative nuclear 
localization signal (NLS) lost the majority of their nuclear staining and 
were no longer active in DNase assays but retained wild-type host 
shutoff function. These data suggest that it is the cytoplasmic fraction 
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of SOX that is involved in cellular mRNA turnover. In agreement with 
this hypothesis, Northern blotting of nuclear and cytoplasmic RNA 
fractions in SOX-expressing cells demonstrated a selective reduction 
in the cytoplasmic messages (Glaunsinger et al., 2005). 

Extensive mutagenesis of the SOX protein led to the identification of 
multiple single-function mutants that retained either the DNase or the 
shutoff function, thereby demonstrating that these activities are ge- 
netically separable (Glaunsinger et al., 2005). The fact that shutoff- 
deficient mutations are distributed across the SOX protein indicates 
that they are likely part of one or more nonlinear motifs required for 
RNA turnover (Fig. 5A). The mutant that selectively lost DNase activ- 
ity mapped to conserved residues of the protein, whereas the mutants 
that lost RNA shutoff activity mapped to numerous nonconserved 
residues. These observations lend further credence to the hypothesis 
that KSHV SOX has evolved a novel function not present outside the 
4-herpesvirus family. 

Given the similarity of the EBV-induced host shutoff phenotype to 
that of KSHV and the conservation of several residues involved in 
SOX-induced shutoff, it seemed likely that the viral effector of the 
EBV-induced mRNA turnover would be the EBV homolog of SOX, 
BGLF5. It has been shown that upon co-expression with BGLF5 the 
levels of a GFP reporter message are drastically reduced in 2938T cells 
and mRNA half-life experiments confirmed that this reduction is due 
to BGLF5-induced mRNA instability (Ressing, M. E., Glaunsinger, B., 
Ganem, D., and Wiertz, E., in preparation). The ability of BGLF5 to 
promote a global host shutoff was demonstrated by infecting Sf9 insect 
cells with a BGLF5-expressing recombinant baculovirus; metabolic 
labeling showed a significant reduction in total protein synthesis 
in the BGLF5-expressing cells compared with uninfected cells or 
cells infected with a control EBNA1-expressing baculovirus (Ressing, 
M. E., Glaunsinger, B., Ganem, D., and Wiertz, E. J., in preparation). 


C. SOX and the Exosome 


Regarding the mechanism of SOX function, the simplest explanation 
would be that SOX has evolved an RNase activity (similar to HSV vhs) 
in addition to its preexisting DNase activity. However, such an activity 
has yet to be consistently detected using a variety of in vitro systems 
(Glaunsinger and Ganem, unpublished observations). Thus, either 
SOX is an RNase but requires one or more cellular cofactors not 
present in the assay conditions examined thus far or, alternatively, 
SOX is not an RNase but instead modulates or redirects the activity of 
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Fic 5. Organization of the SOX protein and models for its function. (A) SOX is a 486- 
amino acid protein containing 7 motifs present in all herpesvirus homologs that are 
believed to be involved in the conserved AE function (black boxes), as well as a centrally 
located NLS. The location of single-function SOX mutations that retain either only the 
DNase activity (asterisks) or only the RNA turnover function (black circle) is indicated 
above. (B) Possible functions of the SOX—exosome interaction. The association of SOX 
with several members of the mammalian exosome suggests that this group of exo- 
nucleases may assist in host shutoff, although the means by which this might occur have 
yet to be determined. Possible models include activation of the exosomal enzymes and/or 
recruitment of the exosome to mRNAs by SOX (this scenario does not require an RNase 
activity for SOX itself), usage of the exosome as a vechicle for SOX to access cellular 
mRNAs (e.g., if SOX possessed RNase activity), or SOX-induced enhancement of exo- 
somal activity (which could occur whether or not SOX functions as an RNase). 


one or more cellular mRNA degradation enzymes. Intriguingly, SOX 
has been shown to interact with several members of the mammalian 
exosome (Glaunsinger, and Ganem, unpublished observations). These 
interactions occur even in RNase A-treated lysates, demonstrating 
that they are indeed protein-protein interactions rather than nonspe- 
cific associations occuring via RNA binding. Furthermore, binding 
reactions using recombinant proteins purified from E. coli revealed 
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that SOX binds at least one of these enzymes (PM/Scl-100) directly, 
and perhaps recruits other members of the exosomal machinery via 
indirect interactions. This preliminary data hints that the mechanism 
of SOX action may involve the appropriation of one or more cellular 
mRNA turnover pathways (Fig. 5B). Whether these interactions result 
in activation of the exonucleases, or SOX-mediated recruitment of the 
exosome to target mRNAs is unknown. Alternatively, if SOX indeed 
proves to have nuclease activity, such interactions may exist to direct 
SOX to the normal mRNA targets of the exosome—in this model, it is 
the exosome that is the targeting device, not SOX. 

An interaction between SOX and the PARN or hPan deadenylases 
has not been observed, which may suggest that SOX associates with a 
different deadenylase or can stimulate RNA degradation in the ab- 
sence of the prior deadenylation required in normal mammalian 
mRNA turnover (such a model might be applied if SOX indeed has 
nuclease function). Alternatively, the observation that several mem- 
bers of distinct mRNA degradation pathways have been shown to 
interact with each other could provide a means for SOX to simulta- 
neously recruit deadenylation and/or decapping enzymes via more 
indirect associations. It could also be envisioned to recruit the exosome 
to mRNAs already bound by deadenylase enzymes. Cytoplasmic SOX 
does not appear to localize to the punctate P bodies associated with 
cellular mRNA decay (Glaunsinger et al., 2005). Thus, it seems unlike- 
ly that SOX activity manifests at these sites, although it is not estab- 
lished whether any P body components are recruited elsewhere during 
KSHV lytic infection. Clearly, much remains to be learned about the 
biochemistry of SOX-mediated mRNA turnover. 


D. Escape from SOX-Induced Turnover 


Although KSHV lytic infection imparts a widespread shutoff to host 
gene expression, expression profiling of infected TIME cells revealed 
that a small group of cellular mRNAs remains upregulated throughout 
the KSHV lytic cycle and therefore must escape SOX-mediated degra- 
dation (Glaunsinger and Ganem, 2004a). These genes are anticipated 
to play key roles in the KSHV lifecycle. One such transcript encodes 
human IL-6, a factor whose expression is common to several KSHV- 
associated neoplasms, including multicentric Castleman’s disease and 
PEL (Asou et al., 1998; Oksenhendler et al., 2002). Another is HIF-1a, 
a transcription factor stabilized during hypoxia whose targets include 
a number of cellular factors frequently involved in tumorigenesis 
(Safran and Kaelin, 2003). Hypoxia can induce lytic reactivation of 
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KSHV and several viral lytic promoters contain hypoxia-responsive 
elements, perhaps indicating that HIF-la may be involved in the 
expression of select viral genes (Davis et al., 2001; Haque et al., 
2003). Transient co-transfection of SOX with each of the escapees 
reveals that these RNAs can evade SOX-mediated degradation even 
in uninfected cells, under conditions in which control RNAs are rapidly 
degraded (Glaunsinger and Ganem, 2004a). This suggests that cis- 
acting sequences (or secondary structures) in some cellular transcripts 
can rescue their RNA from the SOX-mediated degradative pathway. 

It is of note that nearly all of the host shutoff escapees possess AREs 
within their 3’ UTRs, perhaps suggesting that this element somehow 
mediates resistance to SOX activity (Glaunsinger and Ganem, 2004a). 
This theory is supported by the observation that addition of the IL-6 
3’ UTR to GFP confers stability to the GFP message in SOX-expressing 
cells (Glaunsinger and Ganem, unpublished data). In addition, our 
finding that SOX co-immunoprecipitates with the AUBP TTP in cells 
provides a tantalizing connection between these instability elements 
and SOX (Glaunsinger and Ganem, unpublished data). These observa- 
tions are also intriguing given the similar escape of ARE-containing 
messages from HSV-mediated host shutoff and the association be- 
tween HSV-1 vhs and TTP (Esclatine et al., 2004; Hsu et al., 2005; 
Taddeo et al., 2002). One possibility is that these viruses although 
possessing distinct effectors and mechanisms of imposing host shutoff 
have somehow evolved similar means of protecting select cellular 
transcripts. It is important to keep in mind, however, that there are 
numerous ARE-containing messages that do not escape KSHV- 
mediated host shutoff and are susceptible to SOX activity, indicating 
that if the ARE plays a role in resistance, it is not the sole factor 
mediating escape (Glaunsinger and Ganem, 2004a). 


VI. FUTURE DIRECTIONS AND CONCLUDING REMARKS 


Historically, viruses have proven to be excellent teachers of the 
biology of eukaryotic gene expression—virus-encoded regulators like 
HIV tat and rev have opened whole pathways of posttranscriptional 
control to experimental view. We believe that the herpesviral regula- 
tors of mammalian mRNA turnover have the potential for imparting 
similar lessons. Although our understanding of how the viral modula- 
tors of RNA turnover work is still fragmentary, we anticipate that 
the next few years will yield exciting advances in this field. In particu- 
lar, the connections between ARE-containing messages, ARE-binding 
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proteins, and viral effectors of host shutoff should be a major focus of 
research. Elucidation of the precise elements and proteins responsible 
for directing these viral effectors of host shutoff to their mRNA targets, 
as well as defining how select messages (including viral ones!) escape 
degradation are also issues of significant interest that will likely ad- 
vance our knowledge of targeted cellular mRNA decay as well. 
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receptors and entry, 217—228 
virion assembly interactions, 228-235 
virion, minimal set of structural 
proteins, 199 
virion morphology, structural and 
accessory proteins 
accessory proteins, 211-216 
envelope protein (E), 205—206 
genome, 210-211 
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DAN. See Deadenylation nuclease 
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Deletion mapping, 233 
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DNA 
binding proteins, 298, 372 
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DTT. See Dithiothreitol 
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E5, role in carcinogenesis, 133 
E6 and E7, 132-133 
mutants, 234 
E proteins, 109, 127—128, 205, 229 
E2 proteins from HPV16 strains, 132 
E5, E6, and E7 proteins, 129, 133-134, 
137-139 
E6 protein, coexpression of, 133 
E7 expression, 139 
E8 protein, 106 
E2F transcription factors, 127 
E3 ubiquitin ligase complex, 167 
E8R gene, 106 
EL, 35. See also VV-Cop-E9L; 
VACWRO065 
Ebola virus, 226 
EBV. See Epstein-Barr virus 
Ectocarpus siliculosus virus EsV-1, 
323, 326 
EDS1, 172, 174 
and EDS5, mutations in, 170-171 
EDS5, 172-173 
EF. See Elongation factor 
EF-3 proteins, 306 
EIF4A, helicase activity of, 363 
EIF4AII, 362-363 
EIF4A/eIF4G/elIF4E cap binding 
complex, 363 
EIF4B, 362-363 
EIF4E, 339-342, 348 
cap binding by, 343 
EIF4G, 305, 339, 355 
EIF4H, 362-363 
ELAV, 356 
Elongation factor (EF), 306 
EM autoradiography, 62 
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Emiliania huxleyi, 326 

Encapsidation, 236 

Endochitinase, 319 

Endocytosis, 225, 231 

Endonucleases, DNA site specific, 294 

Endoplasmic reticulum-Golgi 
intermediate compartment (ERGIC), 
216, 228-229, 231, 234 

Endoribonucleolytic decay, 348-351 

Envelope (E) protein. See also E proteins 

coronavirus, 203, 205—206 

Environmental Systems Research 
Institute, 13 

Enzymes, deadenylation 

cellular decapping enzymes, 339 

Epstein-Barr virus (EBV), 369-370 

Equine herpesvirus 1 (EHV-1), 360 

ER/Golgi Intermediate Compartment 
(ERGIC), 71 

ERGIC. See Endoplasmic reticulum-Golgi 
intermediate compartment 

ERK, 89 

ERK1, 94 

ERV/ALR protein, 307 

Erythroid cell enriched endonuclease 
(ErEN), 350 

Estrogen, 350 

Ethylene, 163 

ets52 mutant, 86 

Ewingelle americana, 318 

Exocytosis, 96 

Exonucleases, RNA and DNA, 
DEDD, 342 

Exonucleolytic decay pathways, 339 

Exoribonucleolytic decay and decapping, 
347-348 

Exosome, mammalian, 346, 

Extracellular virions (EV), 34 
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F10 kinase, 89 

F10 protein kinase, viroplasm, 82 
F10 protein, 72 

F10L gene, 73 

F17 protein, 94-95 

F17R gene, 95 

F9 protein, 103 
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formation of, 57—58 

FBP2. See K homology splicing regulatory 
protein 

Fibrillarin, 209 

FIPV mutants, 231, 258 

FIPV, 205, 222, 229 

FIPV S proteins, 222 

Flavivirus genome, 210 

Frameshifting, 247-250 

Freeze etch or deep etch electron 
microscopy, 43, 45 

Furin inhibitor, 228 
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G1 protein, 97 
GIL, 97 
G5 mutant infection, 87 
G5 protein, 75-76 
GSR gene, 75 
G5R ORF, 76 
G7 protein, 82-85 
defective, 84 
expression, 83 
GCoV ORFs, 213 
GD virus, 224 
GDP-4-keto-6-deoxy-D-mannose, 325 
epimerase/reductase (GMER), 324 
GDP-D-mannose 4,6 dehydratase (GMD), 
323-325 
GDP-L-fucose, 325 
Gene amplification, 316 
Gene expression, 163 
cellular, 337, 357 
a-Gene promoters, 366 
Gene rearrangement in chlorella viruses, 
patterns of 
gene amplification, 316 
gene replacements, 316-317 
large deletions/insertions, 315 
small deletions/insertions, 315-316 
Genes 
early, 35 
intermediate, 35 
late, 35 
Genome 
of coronaviruses, 210 
encapsidation, 87—90, 99 
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maturation, 87 

packaging, coronavirus, 235-237 
Genomic RNA (gRNA), 237 
Geographic information system (GIS) 

and geostatistics, 6-7 
GFP message, 376 
GFP reporter protein, 371-372 
GFP-ARE reporter, 355 
GIS. See Geographic information system 
Global positioning system (GPS), 6 
a-Globin, 354 

and TfR messages, 350 
-Globin, 354 

mRNA, 360 
-Globin-c-fos ARE reporter, 354 
-1,3-Glucan laminarin, 319 
-1,3-1,4-Glucan lichenan, 319 
-1,3-Glucanase, 317, 320 
16 endo-Z-1,3-Glucanases, 319 
endo--1,3-1,4-Glucanases, 319 
Glucosamine, 318 
Glucosamine synthase, 317 
p-Glucuronic acid (GlcA), 319 
Glutamine: fructose-6-phosphate 

amidotransferase, 321 

Glycosaminoglycans, 108 
G-1-4-Glycosyltransferase, 322 
Glycosyltransferase, 320-321 
GM-CSF, 352, 356 

increased levels of, 353 

stabilization of, 355 
GM-CSF ARE messages, degradation 

of, 355 

GPS. See Global positioning system 
Green alga, 293 

gene products of, 294 
“Green Revolution,” 8 
Groundnut ring spot virus, 176, 178 
GST1, 170 
GW bodies, 341 
GW182, 341 
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HI1L gene, 105—106 

H1 protein, 105—106 
H3 protein, 81, 100-101 
H3L gene, 100-101 

H4: rap94, 93-94 
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genetic analysis of, 93 
H5 
in DNA replication, 75 
mutant infection, 87 
H5 protein, 73, 75 
H5R gene, 73-75 
H6 topoisomerase, 104 
HA, 227 
HA and chitin, genetic differences, 
323, 328 
Hairpin loop, 309 
Hairpin RNA (hpRNA), 179 
Has gene, 321-322 
HAS. See Hyaluronan synthase 
hCCR4, 353 
HCF, 367 
HCoV-229, 221 
HCoV-229E, 227 
RBD, 223 
replicon RNA, 209 
receptor for, 222 
virions, 200 
HcoV-HKU1, 196, 251 
genome, 246 
HCoV-NL63, 196, 211, 222 
HCoV-OC43, 215 
HDepla, 353 
HE* and HE” viruses, 216 
HE gene, 215 
Heat shock protein (HSP), 175, 255 
hsp70, 355 
HSP90, 168 
Helicases, 304 
Helper virus, 8, 179, 235 
Hemagglutinin, 214 
Hemagglutinin-esterase (HE) protein, 
199, 212-216. See also E3 
Hep G2 cells, 343 
HEp-2, 365 
Hepatitis, 194 
Hepatotropism, 218 
Herpes simplex virus (HSV) 
infection, 364, 366-368 
replication, 368 
virions, 362 
type 1 (HSV-L), 130, 357-359, 
365, 370 
infection, 366 
mutants, 359 


vhs, 369 
vhs and TTP, 376 
vhs mutations to, 359 
Herpesvirus(es), 357—358, 370, 372 
a-, 361 
a- and 6-, 360 
a- and y-, 337-338 
a-Herpesvirus—induced mRNA decay, 
357-360 
Vhs as ribonuclease, 360-364 
ARE-containing cellular messages, 
degradation of, 364-366 
Vhs during HSV infection, regulation 
of, 366-368 
Vhs activity in vivo, 368-369 
+-Herpesvirus—induced mRNA decay, 
369-371 
7-Herpesvirus—induced host shutoff, 
effectors of, 371-373 
SOX and exosome, 373-375 
SOX-induced turnover, escape from, 
375-376 
Heterogeneous nuclear ribonucleoprotein 
Al (hnRNPA1), 254-255 
HEV, 214 
HFF cells, 365 
a-HIF-1, 376 
Histidine triad (HIT) pyrophosphatase 
motif, 347 
HIV. See Human immunodeficiency virus 
HIV gp41, 227 
Holliday resolvase, 90 
Host resistance, 162 
hPan deadenylases, 375 
HPV. See Human papillomaviruses 
HRrp4, 353 
HRT signaling pathway, 172 
HRT transgene, 170, 174 
HSki2w, 345 
HSP. See Heat shock protien 
HSV. See Herpes simplex virus 
HSV-2 vhs, 369 
HSVAE protein, 372 
Human immunodeficiency virus (HIV), 220, 
226 
infection, 370 
Human papillomaviruses (HPVs), 125 
and cervical cancer 
role of HPV in, 129-131 
pathogenesis of, 131-134 
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ued) 
prospects for antiviral treatments of, 138 
DNA types, 130 
E6 protein, 127 
E7 protein, 127 
genome, 139 
life cycle, 126-129 
oncogene products, 138 
plasmid DNA, 129 
proteins, 137 
vaccination 
prophylatic against high-risk HPV 
infection, 134-136 
therapeutic against cervical 
carcinoma, 136-138 
HuR, 352, 356 
HUVECs, 354 
HXXEH motif, 97 
Hyaluronan synthase (HAS), 321 
Hyaluronan synthase gene (has, A98R), 317 
Hyaluronan, 294, 321 
Hydra viridis, 296 
Hypersensitive response (HR), 162 
HR1 and HR2, 226, 228 


I protein, 212 
IkBa mRNAs, 365 
Il protein, 94 
I2L gene, 107 
I5 membrane protein, 109-110 
16, 88-90 
I6L gene, 89 
I7 protein, 97—98 

mutants, 99 
I7L, 97-98 
I8 RNA helicase, 104 
IBV. See Infectious bronchitis virus 
Icosahedral ribonucleoprotein capsids, 200 
ICP8, 372 
TEX-1, 365 
TEX-1 mRNA, degradation of, 366 
IFN-stimulated genes (ISG), 369 
IGF II, 349 
IL-1 

8 and TNFa mRNAs, stabilization 

of, 355 
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IL-3, 352, 356 
IL-6, human, 375 
IL-8, 369 
stabilization, 356 
Immature virion formation 
A10: p4a/4a, 86-87 
seven-protein complex, 82—86 
Immunoglobulin, 218 
Infectious bronchitis virus, avian 
(IBV), 194 
E protein, 205-206 
frameshifting, 247 
M protein, 204 
N protein, 208-209 
S protein, 202 
virions of, 235 
Influenza C virus, 215 
Influenza virus, 226—227 
Inoculum 
sources, 9 
wind-borne, 4 
Insect pest management, 7 
Integral membrane proteins, 52, 205 
Integral MV membrane proteins, 79 
Interferon a (IFN), 369 
Ion transport proteins, 327 
IPTG, 72, 77, 80, 84, 86 
inducerts, 85 
IRES (internal ribosome entry site), 305, 
361-363 
IRF-3, 369 
IRF-7, 369 
Iridoviruses, 295, 303 
Iscarcinoembryonic antigen (CEA) 
family, 218 
Isochorismate synthase, 170 
IV and MV formation, new models 
for, 64 
IV membrane biogenesis, 80 
IV to MV, morphogenesis from, 65 
IVN, 56-57 
IVN to MV, morphogenesis from, 64 


J 
J1 protein, 82, 85 


mutant infection, 87 
Jasmonic acid (JA), 163 
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JHM, 225 
JHM S protein, 202 
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K homology splicing regulatory protein 
(KSRP), 352, 355-356 
Kt channel (Kev), 294, 302, 324 
K562, 354 
human erythroleukemic cell line, 354 
Kaposi’s sarcoma (KS), 370 
Kaposi’s sarcoma-associated herpesvirus 
(KSHV), 369-373, 376 
infection, 371 
lytic infection, 375 
Kev gene, 316 
Kev. See K* channel 
Keratinocytes, 133 
“Kriging,” 4 
KRRSRR, 202 
KSHV genes, 371 
KSHV. See Kaposi’s sarcoma-associated 
herpesvirus 
KSRP. See K homology splicing regulatory 
protein 
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L1 protein, 100, 104, 129, 134 
recombinant L1 protein, 135 
L1R gene, 100 
L2 protein, 129 
L3 protein, 105 
L3L gene, 105 
L4 protein, 105 
LAR gene, 105 
“Lateral bodies”, 32-33, 47, 49, 54, 70 
Leaf discoloration, 9 
Leaf hopper, 9. See also Circulifer 
tenellus 
abundance, 22, 24 
dispersal, 11 
immigration, 22—23 
immigration and population 
development, seasonal patterns, 10 
plant-to-plant spread by, 10 
vectors, 9, 24 
adults and nymphs of, 12 
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Lettuce mosaic disease, 

spread of, 5 
Lettuce mosaic virus (LMV), 24 
Leucine-rich repeats (LRRs), 163 
Light trap, 11 
LMV. See Lettuce mosaic virus 
LRR. See Leucine-rich repeats 
L-SIGN, 221 
Lsm complex, 339 
Lycopersicum esculentum, 177 
L. hirsutum, 177 
L. peruvianum, 177 
Lysine di-methyltransferase, 324 


M 


M and E protein virion component, 
coexpression of, 229 
M glycoprotein, 203 
M protein(s). See Membrane proteins 
M7GpppN cap, 339 
Maternal homeodomain 
proteins, 349 
Mature virions (MV), 34 
biogenesis, 71 
formation, 90 
core proteins for transcriptional 
competence, 104—106 
IV to MV transition, 94-101 
membrane proteins affecting virus 
binding, entry and fusion, 
107-108 
morphogenesis arrests during 
nonpermissive infections, 74 
lipid composition of, 48 
maturation, 104 
particle surface restructuring, 
101-104 
transition, 99, 104 
transcription apparatus within core, 
assembly of, 91-94 
transport, occlusion and secondary 
wrapping, 108-109 
A26, 109 
A27, 108 
virion protein precursors, proteolysis 
of, 96-99 
wrapping of, 104 
Measles, 220 
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A17, 71 
coronavirus, 203—205 
Membrane biogenesis, 70, 73, 80 
Membrane enzymes, 51 
Membrane structural proteins, 51 
2-Mercaptoethanol, 48—49 
Metabolic inhibitors, 81 
Metalloproteases, 97 
Methionyl-tRNA synthetase, 325 
2'-O-Methyltransferase activity, 254 
MHV. See Mouse hepatitis virus 
MHV DI RNAs, 238-240 
MBHV E protein, 205—206 
MBV helper virus, 236 
MHV M protein, 204 
MHV mutants, 215, 231, 245, 256 
MBHV N protein, 207—209 
MHV RBD, 223 
MHV S protein, 219, 225, 227-228 
A59, 202 
endodomain of, 232 
mutations in, 226 
RBD of, 226 
MHV strain A59 mutant, 228 
Microscopy 
atomic force, 43, 45 
cryo-electron, 43, 45 
freeze etch or deep etch electron, 
43, 45 
Microtubules, 108 
Mimivirus, 295 325 
genome, 326 
MIP-1a, 369 
Mitochondrial associated membranes 
(MAM), 71 
Mitogen-activated protein kinases 
(MAPKs), 169 
Mitosis, 128, 345 
Mouse hepatitis virus (MHV), 194, 200 
infection, 216 
packaging, 236 
signal, 237 
receptors, 218-219 
reverse genetic approach for, 244 
virions, 232 
MPP6, 345 
MRBG1, 297 
mRNA 
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capping, 52 
decay, 337-338, 357, 362 
posttranslational, 341 
degradation enzymes, 341 
inactivation, 351 
and regulation in herpesviral infection, 
337 
stability, control of, 338 
stability complex, 354 
mRNA binding proteins, 350 
mRNA capping enzyme, 304 
mRNA turnover 
cellular, 370 
EBV-induced, 373 
enzymes, 353 
global, 363 
herpesviral regulators of, 376 
in infected cells, 358 
pathways of, 340 
regulated, 341 
SOX-induced, 372 
SOX-mediated, 375 
stability and translation, 339 
MRNP complex, 350 
MT325, 308, 314 
mtr3, 345 
Multicentric Castleman’s disease (MCD), 370 
Mutants, conditional lethal, 57 
Mutations, 134 
in EDS1, EDS5, PAD4, SID2 genes, 
170-171 
MV. See Mature virions 
Mycoplasma genitalium, 326 
Myeloid hyperplasia, 353 
Myxoviruses, 194 
Myzus persicae, 5 
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N gene, 207, 213 

translocation of, 240 
N protein, 168, 206-207, 216, 233 
NHS motif, 79 
N-Acetylglucosaminidase, 319 
NahG transgene, 170, 174 
Nanoarchaeum equitans, 326 
N-C64A, 297, 301-302, 308-309, 314, 

319-320 
genome, 316 
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suppressors of, 233 
viruses, 296, 317 


NCLDV See Nucleocytoplasmic large DNA 


viruses 
NDR1, 172, 174 
Nematodes, resistance to, 176 
NendoU activity, 254 
Nephotettix virescens, 9, 25 
Neurotropism, 218 
N-gene expression, 164 
Nidovirales, 195, 213, 238 
NIH 33, 354 
Nilaparvata lugens, 23 
Nocturnin, 341, 344 
Nonpersistent viruses, 3 
beet mosaic 6 
Nordihydroguiaretic acid, 71 
Not1-5, 343 
NPR1, 173-174 
mutation in, 172 
NPR1-like protein, 167 
NRG1, 169 
nsp13, 253 
nsp14, 253 
nsp 15, 253 
NSPPP, 89 
Nuclear inclusion protein, NIaPro, 176 
Nuclear localization signal (NLS), 372 
Nuclease mapping, 249 
Nucleocapsids, 32, 216, 229, 236 
symmetric, 200 
Nucleocapsid (N) protein, coronavirus, 
203, 206-210 
Nucleocytoplasmic large DNA viruses 
(NCLDV), 295, 324-326 
‘Nucleoids,” 50, 56, 65, 68 
Nucleolin, 209, 304 
Nucleoproteins, 88, 95, 178 
NY-2A, 308-309, 314 
genome, 315-317 
NC64A virus, 302 
NY-2A genes, 308 


octl, 367 
OIP2/Rrp43, 345 
Oleic acid, 172 
Oncogenes, 131, 134 


Open reading frame (ORF), 37, 205, 211, 
213, 313, 316, 359, 371 
encoded by PBCV-1, 310-312 
Ornithine decarboxylase, 324-325 
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P bodies, 339, 341, 375 
cytoplasmic, 351 
mammalian, Xrn1 in, 348 
p21, 356 
p37 and p42, overexpression of, 354 
p38 MAPK/MK2 pathway, 353 
p38/MK2 activation, 356 
p40, dephosphorylation of, 355 
p4b/4b protein, 96 
P53 tumor suppressor, 127, 138 
pablp, 344 
PABP. See poly A binding protein 
PAD4, 172, 174 
mutations in, 170-171 
“Palisade layer,” 46, 48—49, 54, 69 
Palm civets, 224 
Palmitoylation, 228 
Pan activity in yeast, 344 
Papaya ringspot virus (PRSV), 178 
Papillomas, 127 
Papillomavirus. See also Human 
papillomaviruses 
E1 and E2 proteins, 128 
E5, E6, and E7 proteins, 137 
proteins, 129 
PAPK sequences, 301 
Paramecium bursaria, 296 
Paramecium bursaria chlorella virus 
(PBCV), 295 
Paramecium bursaria chlorella virus 1 
(PBCV-1), 294, 297, 301, 304, 306, 324 
3 D-image reconstruction of, 300 
antigenic variants, 320 
capping enzymes, 303 
cryo-electron microscopy density maps 
of, 299 
and CVK2 terminal hairpin ends, 
nucleotide sequences at, 309 
and CVK2, gene arrangement between, 
314 
gene expression, 327 
gene products, 313 
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Paramecium bursaria chlorella virus 1 
(PBCV-1) (continued) 
genome, 293, 303, 307, 313, 315, 327 
replication, 317, 320 
representative ORFs encoded by, 
310-312 
virion, 302 
Parcelliary maps, 13 
PARN. See Poly(A)-specific exoribonuclease 
“Pathogen-derived resistance,” 177-178 
Pathogenesis-related (PR) genes, 166-167 
PBCV. See Paramecium bursaria chlorella 
virus 
PBCV-1. See Paramecium bursaria 
chlorella virus 1 
PBCV-1 A140/145R, 301 
PBCV-1 A181R/A182R protein, 319 
PBCV-1 DNA, 302 
PBCV-1 genes, 317 
PBCV-1 ORF, 322 
PBCV-1 RNA synthesis, 304 
Pbi viruses, 296, 297, 308, 314 
PB-SW1, 297 
PCoV ORF, 213 
PDF1.2 genes, 173 
PEDV, 211, 222 
Pepper mild mottle virus (PMMV), 179 
Peptidoglycan synthesis, 321 
Peronospora parasitica, 169 
PFU. See Plaque-forming units 
Phaeocystis pouchetii (PpV), 326 
Phaeovirus, 295 
Phenylalanine ammonia lyase, 166 
Phosphatidyl ethanolamine, 48 
Phosphoinositide phosphates, 72 
Phosphoproteins, 207 
Phosphorylation, 208 
Photoperiod, 8 
Phycodnaviridae, 293—294 
Phycodnaviruses, 327 
evolutionary history of, 324 
Phytoalexin, 170 
Phytohormones, 163 
Picornavirus genome, 210 
Plant pathogens, conventional protection 
methods, 162 
Plant resistance to viruses. See also Viral 
resistance in plants 
signal transduction and defense against 
viral pathogens, 161-163 
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R gene-mediated resistance to viral 
pathogens, 164-177 
viral resistance, 177—180 
Plant virus diseases 
economic importance of, 2-3 
spreading between fields 
field-to-field, 5—6 
geographic information systems and 
geostatistics, 6-7 
long distance, 4—5 
spreading within plantings, 3—4 
Plaque-forming units (PFU), 302 
Plasma membrane associated membranes 
(PAM), 71 
Plum pox virus, systemic resistance 
to, 179 
PM/Scl-100, 345 
PM/Scl-75, 345, 353 
PNIH 3T3 cells, 343 
Pol I, 361 
Poliovirus, 244 
IRES, 361 
Poly A binding protein (PABP), 339, 342, 
344, 348, 350, 355 
Poly C binding proteins, 350 
Poly(A) nuclease complex (hPan2/Pan3), 
341, 343 
Poly(A) polymerase, 35, 93, 305 
Poly(A)-specific exoribonuclease (PARN), 
341, 343-344, 353, 356 
cDNA, 342 
phosphorylation, 343 
and SOX, 375 
Polyadenylation, 52 
Polyamine biosynthesis, 327 
Polymyositis-scleroderma overlap 
syndrome, 345 
Polynucleotide phosphorylase (PNPase), 
345 
Polysaccharide biosynthesis, 327 
Polysaccharide lyase, 317-320 
Polysaccharide synthesis, viral encoded 
enzymes in, 323 
Polysomal ribonuclease I (PMR1), 350, 363 
Pop2p/Caf 1p, 343 
Posttranscriptional gene silencing 
(PTGS), 178 
Potato mop-top virus (PMTV), 178 
Potato virus X (PVX), 163, resistance 
in potato 
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Nb-mediated, 175 
Rx- and Rx2-mediated, 175 
Poxviridae, 33, 34 
Poxvirus(es), 32, 295, 303-304, 313 
biology and replication, 33-35 
genetics, 36, 41-42 
genome, 35, 315 
comparison of, 34 
membrane growth, model for, 69 
virions 
extracellular virions (EV), 34 
mature virions (MV), 34 
morphogenesis and structure 
of, 32-33 
wrapped virions (WV), 34 
PR-1 and PR-5, 173 
PR-1, PR-2, PR-5, 170 
Prasinovirus, 295 
Primary effusion lymphoma (PEL), 
370, 375 
PRM1, 350 
Prolyl 4-hydroxylase, 307 
Proteasome, 167 
Protein disulfide isomerase, 307 
Protein kinases, 298 
Proteinase K, 360 
Proteinvirus proteins, 64 
Proteolysis 
of core proteins, 98—99 
of vaccinia core, 84 
PRV. See Pseudorabies virus 
Prymnesiovirus, 295 
Pseudomonas syringae pv. tomato, 169 
Pseudorabies virus (PRV), 359-360 
Pseudovirions, 225 
PSORT, 320 
PSSP motifs, 94 
PTGS, 179 
PVX. See Potato virus X 
PVY resistance in potato, Ry- and 
Nytbr-mediated, 175-176 
Pyramimonas orientalis (PoV-01B), 326 
Pyrimidine tract-binding protein 
(PTB), 255 
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R gene-mediated resistance to viral 
pathogens, 164 


dominant R genes against plant 
viruses, 165 
N gene-mediated resistance to Tobacco 
mosaic virus in tobacco, 164-169 
HRT-mediated resistance to Turnip 
crinkle virus in Arabidopsis, 
169-172 
RCY1-mediated resistance to Cucumber 
mosaic virus in Arabidopsis, 
172-174 
RTM1-mediated resistance to Tobacco 
etch virus in Arabidopsis, 174-175 
Rx- and Rx2-mediated resistance to 
potato virus X in potato, 175 
Nb-mediated resistance to PVX in 
potato, 175 
Ry- and Nytbr-mediated resistance to 
Potato virus Y in potato, 175-176 
Sw-5-mediated resistance to Tomato 
spotted wilt virus in tomato, 176 
Tm-22-mediated resistance to Tomato 
mosaic virus in tomato, 176-177 
Rsvi1-mediated resistance to Soybean 
mosaic virus in soybean, 177 
L locus-mediated resistance to 
tobamoviruses in pepper, 177 
R proteins, 162, 176 
CC-NBS-LRR type, 167, 170, 173 
NBS-LRR type, 164 
TIR, 167 
Rabbit reticulocyte lysates (RRL), 360, 362 
RAD2, 361 
Rap94, 91-92 
Raphidovirus, 295 
RAR1, 171-172 
RAR 1-like protein, 167—169 
Rb tumor suppressor pathway, 138 
RCoV and SDAV, 220 
RCY1 transgene, 174 
RCY1, 173 
RdRps, 180 
Receptor angiotensin-converting enzyme 
2 (ACE2), 220 
Receptor-binding domains (RBDs), 
223-224 
Receptors and entry, coronavirus, 
receptor recognition, 222-224 
receptors, 217—222 
S protein conformational change and 
fusion, 224—228 
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Replicase gene, protein products of, 250 
Replicase polyprotein, 216 
Reporter mRNAs, 361 
Reporter RNAs, degradation of, 362 
Resistance protein(s) 
TIR-NBS-LRR 
CC-NBS-LRR, 169 
Resistance-signaling pathways 
HRT-mediated, 168 
N-mediated, 168 
RCY1-mediated, 168 
Retinoblastoma (Rb) tumor suppressor, 
127 
Retroviruses, 246 
Rhinoviruses, 220 
Ribosomal frameshifting 
coronavirus, 246-250 
RNA elements for, 247 
Ribozymes, 138 
Rice green leaf hopper, 9 
Rice tungro bacciliform virus (RTBV), 8-9 
Rice tungro diseases 
biology, 8-9 
economic importance, 7 
economic threat, 7—8 
in Philippines, case study, 11-26 
contol of, 23 
infection pattern between two 
successive sampling periods, 
16, 20 
mean peak incidence and frequency of 
planting, 14-15, 22-23 
source distance, vector abundance 
and field vulnerability, 16-19 
vulnerability of fields, 14-19 
spreading, within-field spatial patterns, 
9-10 
tungro vectors 
flight characteristics of, 10—11 
source distance, abundance, and field 
vulnerability, 16, 22 
Rice tungro spherical virus (RTSV), 8-9 
disease, new infections of, 1-2 
Rifampicin, 66, 81-85, 98 
reversal, 67 
RNA and DNA exonucleases, DEDD 
superfamily, 342 
RNA binding, 237 
RNA binding domains, S1/KH, 345-346 
RNA binding motifs, 356 
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RNA binding proteins, 339 
ELAV family of, 356 
GW182, 341 
RNA capping, 254 
RNA genome, 178 
RNA guanylyltransferase, 303-304 
RNA helicase, 345 
RNA polymerase, 35, 303 
inhibitors, 209 
RNA polymerase III promoter, 307 
RNA recognition motif (RRM), 355-356 
RNA recombination, 257—258 
RNA shutoff activity, 373 
RNA silencing, 162, 178-180, 357 
RNA synthesis, 240, 242, 244-245 
RNA synthesis, coronavirus 
replication and transcription, 237-243 
RNA recombination, 243—246 
replicase complex 
ribosomal frameshifting, 246-249 
replicase proteins, 249-254 
host factors, 254—256 
RNA triphosphatase, 303 
RNA viruses, 8, 194, 200, 226 
RNA-dependent RNA polymerase (RdRP), 
165, 253 
RNAi pathway, 351 
RNA-induced silencing complex 
(RISC), 351 
RNase, 373 
RNase D motifs, 342, 345 
RNasin, 360 
Roniviridae, 195 
RPP8, 173 
RPS2, 169 
RPW8, 171 
in Arabidopsis, 163 
RRADR, 228 
RRAHR, 202 
wild-type, 228 
RRL, 362 
rRNAs, 344 
Rrp, 345-346 
RRT, 170 
Rsv1, 177 
RTBV See Rice tungro bacciliform virus 
RTM1, 174 
RTM2, 175 
RTSV See Rice tungro spherical virus 
Ry gene, 176 
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S glycoprotein, 201 
S proteins, 201, 215, 228, 231 
binding receptor function of, 221 
CCoV, 222-223 
ectodomain, 202 
swapping of, 218 
endodomain, 232 
and M proteins, coimmunoprecipitation 
of, 230 
maturation, 202 
and receptor, 217 
of TGEV, 222-223 
S1/KH RNA binding domains, 345-346 
Saccharopolyspora (Streptomyces) 
erythraeus, 318 
SA-induced protein kinase (SIPK), 169 
Salicylate hydroxylase, 170 
Salicylic acid (SA), 163 
sarlp, 71 
SARS, adaptation of, 217 
SARS-CoV, 196, 198, 207, 211, 220-222, 
225, 227 
8a protein, 212-213 
E protein, 206, 235 
genome, frameshifting region of, 249 
infection, 221 
RBD, 223 
RdRp, 253 
replicase, 249 
S protein, 202, 220, 225, 227 
S protein RBD, 224 
virion assembly, 229 
sat-RNAs, 179 
Scavenger decapping activity (DcpS), 347 
SCF-E3 ubiquitin ligase family, 307 
SDAY, 220 
Seed-borne infection, 3 
Semipersistent viruses, 3, 9 
Senescence, 125 
Sequiviridae, 8 
Severe acute respiratory syndrome 
(SARS), 193-194 
Sexually transmitted disease, 136 
SgRNAs, 242 
synthesis, 244 
SGT1, 172 
SGT1b, 171 
SGT1-like protein, 167-169 
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Short interfering RNAs (siRNAs), 178 
SID2, 172 
mutations in, 170 
Signal transducers, 167 
Signal transduction and defense against 
viral pathogens in plants, 161-180 
Signalosome, 168 
SiRNAs, 351 
antisense strand of, 179 
Skin cancers, 130 
skp1 protein, 307 
“Slippery sequence,” 247 
Smallpox, 32 
SMIF, 348 
“Smooth layer,” 46, 49, 54, 69 
SnoRNAs, 344 
SnRNAs, 344 
SOX (shutoff and exonuclease), 371-373 
activity, resistance to, 376 
cytoplasmic, 375 
and exosome, 373-375 
induced turnover, escape from, 
375-376 
and PARN, 375 
SOX protein, organization and models for 
its function, 374 
Soybean mosaic virus (SMV), G1-G7 
strains, 177 
Soybean mosaic virus resistance in 
soybean, Rsvl-mediated, 177 
Spermidine, 47 
Spermine, 47 
Spermine zinc finger protein (ZFT1), 167 
“Spicule layer,” 54, 66, 81-82, 101 
Spike Protein (S), coronavirus, 201—203 
Splicing, 354 
SRPa RNA, 361 
ssi2 plants, 172 
STE. See Surface tubule elements 
Stearoyl-acyl carrier protein desaturase, 
171 
Structural proteins, 228 
membrane-bound, 216 
mutants, 256 
Subgenomic RNAs (sgRNAs), 238 
“Subnucleoids,” 50 
Sugar beet, 25 
Sugar beet yellows, 3 
Sulfolobus solfataricus, 325 
Suppressors of RNA silencing, 180 
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Surface tubule elements (STEs), 43, 49, 
101, 104 

Sweep net, 12 

SYNCRIP, 255 

Syncytia formation, 227-228 

Syngen 2-3, 297 

Systemic acquired resistance (SAR), 174 


T 
T cells, 368 
E6- and E7-specific, 137 
T2141, 363 


TAP peptide transport, 368 
TCV. See Turnip crinkle virus 
Telomerase, expression of, 133 
Tetracycline, 72, 80 
TEV, 178 
CP, 179 
TFIIB, 303-304 
TFIID, 303 
TFIS, 303 
TfR and a-globin messages, 350 
TGEV, 199-200, 205, 209, 218, 222, 225 
E protein, 205-206 
N protein, 208 
packaging signal, 236 
RBD, 223 
receptor, 221 
reverse genetic approach for, 244 
S protein, 202 
virions, 232, 235 
Thiol oxyreductase, 100 
THP-1 leukemia cells, TPA treatment 
of, 355 
Thrips, 6 
TIA-1, 365 
TIA-1/TIAR, 355, 366 
TIAR, 365 
Tillering, 12 
TIP (TCV interacting protein), 174 
TIR-NBS-LRR protein, 164 
Tissue tropism, 221 
Tm-2 and Tm-22 genes, 177 
TMV. See Tobacco mosaic virus 
TNFa and IL-18 mRNAs, stabilization 
of, 355 
TNFa, 352, 356 
increased levels of, 353 


Tobacco etch virus (TEV), 174 
resistance in Arabidopsis, 
RTM1-mediated, 174-175 
Tobacco mosaic virus 
CP gene, 178 
resistance in tobacco, 168 
N gene-mediated, 164-169 
Tobacco ringspot virus (TRSV), 178 
Tobacco, transgenic, 166 
Tobamoviruses resistance in pepper, L 
locus-mediated, 177 
Toll-interleukin-1 receptor (TIR), 163 
Tomato chlorotic spot virus (TCSV), 
176, 178 
Tomato diseases, 6 
Tomato mosaic virus (ToMV), 6 
resistance in tomato, Tm-22-mediated, 
176-177 
Tomato R gene, 176 
Tomato spotted wilt virus (TSWV), 176 
resistance in tomato, Sw-5-mediated, 176 
Topoisomerase, 93 
TOR2 virus, 224 
Toroviruses, 195-196, 213 
Tospoviruses, 176 
Transcription, discontinuous negative- 
strand, model for, 243 
Transcriptional repressors, 343 
cellular, 132 
Transcription-regulating sequences 
(TRSs), 241-242, 244, 255 
Transferrin receptor (TfR), 349 
Transgene 
HART, 170-171 
nahG, 170 
Transgenic tobacco, 166 
Transposases, 308 
Trap, light, 11 
Tristetraprolin (TTP), 352-354, 
365-366, 376 
TRNA genes, 307, 313 
TsA3 mutants, 96 
TsA30 mutants, 85 
TsA30 protein, 83 
TsF10 
infections, 83 
mutants, 75, 86 
TsG7 
mutants, 85 
protein, 85 
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TsJ1 protein, 85 
TSWV nucleocapsid protein, 178 
TTP phosphorylation, p38 MAPK/MK2- 
mediated, 353 
TTP. See Tristetraprolin 
Tumor suppressor genes, 134 
Tumor viruses, study of, 125 
Tumorigenesis, 375 
Tungro disease, 8, 14 
spatial patterns of, 11 
Tungro infection in rice 
control of, 23 
fields vulnerability, 14-19 
incidence categories, 13-19 
ordinal regression model for, 17 
infection patterns, 14-19 
between two successive sampling 
periods, 20 
source distance and field 
vulnerability, 16-19 
Tungro infection, symptoms of, 12 
Tungro symptoms, 9 
Tungro vectors, dispersal range of, 11 
Tungro virus disease in rice, new 
infections of, 1—2 
Tungro viruses, 9, 10 
retention period of, 24 
Tunicamycin, 202 
Turnip crinkle virus (TCV), 163 
resistance, 172, 168 
HRT-mediated, 169-172 
Tyr phosphatase, 307 
Tyr phosphorylation, 77 
Tyrosyl-tRNA synthetase, 325 
Tyr-protein kinase, 307 
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Ubiquitin, 307-308, 355 
ligase complex, 167 
UDP-glucose dehydrogenase (ugdh), 317, 
321-322 
Uganda, cassava mosaic disease epidemic 
of, 2 
UL12, 372 
UL41, vhs mutations to, 359 
Undecaprenolphosphate, 321 
Urokinase, 356 
receptor, 356 
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Vaccinia, 32-33 
assembly, 54 
coded redox system, 102 
conditional lethal mutants, 36, 
41-42, 57 
core and membrane proteins, 
proteolysis of, 84 
crescent, 70 
factories, 54 
genetic nomenclature, 35-36 
genome, 49-50, 97 
internal features, 46 
morphogenes, 54-57, 70 
genetics, 59, 62 
inhibition of, 81 
intermediates, 56 
stages of, 57 
surface features, 44 
viral membrane biogenesis, 67 
virion proteins, 37—40 
genetic nomenclature, 35-36 
virion structure 
determinants of, 86 
model for, 53 
substructures, 49 
Vaccinia RNA polymerase, 91 
Vaccinia virion MV 
chemical composition, 47—48 
controlled degradation, 48—49 
core genome structure, 49-50 
proteins, 50-52 
structure 
imaging studies of, 42—47 
model for, 52-54 
Vaccinia virion 
IV-—IVN transition, 87 
core proteins, 89-90 
membrane proteins, 88-89 
Vaccinia virus disulfide bond formation 
pathway, 102 
VACWR065, 36 
Varicella-zoster virus (VZV), 360 
Variola, 32-33 
VChta-1 gene, 318-320 
Vel and Ve2 in tomato, 163 
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arthropod, 3 
viruliferous, 3 
Vegetative propagules, 3 
VEGF, 356 
Vero cells, 199, 223 
E6 cells, 220 
VETF, 91-92 
genetic analysis of, 92-93 
mutants, 93 
Vhs, 362, 366 
activity in vivo, 368-369 
during HSV infection, regulation 
of, 366-368 
as ribonuclease, 360-364 
targeting to mRNAs, model for, 364 
Vhs 
mutants, 363 
null virus, 367 
proteins, 361 
Viral DNA, 99 
replication, 35, 87, 127-128 
sites of, 54, 62 
“Viral factories,” 54 
Viral gene(s) 
expression, 228, 358-360, 366 
silencing, 178—180 
Viral genomes, 326 
Viral mRNAs, 358 
synthesis, 52 
Viral oncogenes, 133 
Viral pathogens 
NBS-LRR category, 164 
in plants, signal transduction and 
defense against, 161-180 
Viral proteins, 50, 68, 234 
Viral replication, 306, 315 
cycle, coronavirus, 216 
Viral resistance in plants 
pathogen-derived resistance, 177-178 
viral gene silencing, 178-180 
Viral RNA, 138 
Viral RNA polymerase, 93 
Viral RNA-dependent RNA polymerase, 
196 
Viral satellite RNAs (sat-RNAs), 178 
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